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The present invention relates to a DNA sequence, a combination of DNA sequences, a vector, a 
microorganism, a method for producing xylose reductase and/or xylitol dehydrogenase, xylose reductase 
and xylitol dehydrogenase; the invention further relates to an ethanol manufacturing process, a process for 
production of biomass, a process for recycling of NADP from NADPH and a method for producing a 
5 desired protein in Pichia stipitis. 

D-xylose is one of the most abundant carbohydrates occurring in plant biomass and wood. In the 
process of cellulose production, it is formed as a waste product from hydrolysis of xylan, which is the major 
compound of hemicellulose. To optimize the use of renewable carbon sources, it is desirabiejo convert 
xylose into ethanol or biomass. There are several yeast species, such as Candida {Gong et al M 1981, 
w Jeffries, 1983), Debaryomyces. Hansenula, Kluyveromyces , Metschnikowia , Pachysolen , Paecilomyces (Wu 
et al., 1986) and Pichia (Maleszka and Schneider 1982), which are able to utilize pentoses, including D- 
xylose, and D-ribose, however, only aerobically. 

In general, pentoses utilized by yeasts (e.g. Pichia stipitis ) must be isomerized to pentuloses in order to 
be phosphorylated. This isomerization occurs via a NAD(P)H linked reduction (reductase) to pentitols 
75 followed by NAD*-Iinked oxidation (dehydrogenase) of the pentitols to the corresponding D-pentuloses 
(Barnett, 1976). The yeast mainly used in bioethanol production, S. cerevisiae , can utilize xylulose, however, 
this yeast is not able to ferment pentoses (Jeffries. 1988). It cannot be excluded, that S. cerevisiae also 
contains genes, coding for pentose fermenting proteins which however are not expressed. 

Pentose fermentation by S. cerevisiae may be possible by providing a xylose utilising pathway from a 
20 xylose metabolizing organismTRowever, although many attempts have been undertaken to express bacterial 
xylose isomerase genes in S. cerevisiae, no xylose fermentation could be obtained probably due to 
inefficient expression of the foreign gene (Sarthy et al., 1987, Amore et al.. 1989, Chan et al., 1986 & 1989). 

Therefore it is a primary object of the present invention to provide genes of the enzymes involved in 
xylose degradation in order to be able to manipulate these genes, for example to combine these sequences 
25 with suitable regulating sequences. 

This object has been solved by a DNA sequence comprising a structural gene coding for xylose 
reductase and/or xylitol dehydrogenase and being capable of expressing said polypeptide(s) in a microor- 
ganism. 

Further objects of the present invention will become apparent by the following detailed description of 
30 the invention, the examples and figures. 

Throughout this application various publications are referenced by the first author within parenthesis. 
Full citations of these references may be found at the end of the specification as an annex. The 
disclosures of these publications in their entireties are hereby incorporated by reference into this application 
in order to more fully describe the state of the art as known to those skilled therein as of the date of the 
35 invention described and claimed herein. 

The DNA sequences according to the present invention preferably are derived from a yeast. Preferred 
yeast strains are selected from the genera Schwanniomyces , Saccharomyces , Kluyveromyces , Pichia, 
Hansenula, Candida, Debaryomyces , Metschnikowia , Pachysolen and Paecilomyces . All of these yeast 
genera are known to be able to convert xylose into ethanol using xylose reductase and xylitol de- 
40 hydrogenase. 

A preferred genus used as a source for the DNA sequence according to the present invention is the 
yeast Pichia . This genus comprises several species, any of which could be applied for performing the 
present invention. However, the preferred species is Pichia stipitis . The present inventors used Pichia 
stipitis CBS5773 for isolation of the DNA sequences comprising a structural gene coding for xylose 

45 reductase and/or xylitol dehydrogenase. Pichia stipitis CBS5773 was redeposited under the Budapest 
Treaty on March 21, 1990 (DSM 5855). 

The present inventors succeeded to isolate DNA molecules containing a sequence comprising the 
structural gene encoding a xylose reductase and a xylitol dehydrogenase respectively. By way of the DNA 
sequence, which was determined according to standard procedures, the amino acid sequence of both these 

50 proteins could be determined for the first time. The complete amino acid sequences as well as the 
nucleotides sequences of both these proteins are shown in Rgures 2A and 2B. As is known to everybody 
skilled in the art the proteins having the amino acid sequences as shown in Rgures 2A and 2B can be 
encoded not only by the DNA sequences as found in Pichia stipitis CBS5773, but also by using alternative 
codons provided by the degeneracy of the genetic code. The invention thus is not limit d to the DNA 

55 sequence as shown in Rgure 2, but also comprises any modification yielding the sam amino acid 
sequences. 

The DNA sequences according to the present invention may not only be obtained by applying the 
methods shown below, i.e., by isolating cDNA clones, which further on are used to screen a genomic 
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library, but also may be obtained by other methods of recombinant DNA technology from either natural 
DNA or cDNA or chemically synthesized DNA or by a combination of two or more of these DNAs. For 
example, it may be attempted to combine a chemically synthesized 5' region with a cDNA coding for the 3' 
region or any other combination of the three DNA sources mentioned above. 
5 According to the present invention there are also provided combinations of DNA sequences, which 
comprise a DNA sequence as discussed above, i.e., a sequence comprising a structural gene coding for a 
xylose reductase and/or xylitol dehydrogenase, and in addition one or more DNA sequences capable of 
regulating the expression of the structural genes mentioned above in a presumptive host microorganism. 
DNA sequences capable of regulating the expression of structural genes are well known to those skilled in 
10 the art. For example, the DNA sequences discussed above may be combined with promoters, which are 
connected with the structural genes in order to provide efficient expression. Further DNA sequences 
capable of regulating the expression may comprise enhancers, termination sequences and polyadenylation 
signals. Examples for the best known kind of regulating sequences, are shown by the following examples. 
In order to express the DNA sequences and/or the combination of DNA sequences according to the 
is present invention efficiently, small modifications of the DNA sequences may be performed, as long as their 
capability to express a functional enzyme having the desired xylose reductase or xylitol dehydrogenase 
activity is retained. These modifications may include either variations of the genetic code as discussed 
above or furthermore small substitutions of the amino acid sequence, as well as deletions and/or insertions, 
which do not have any detrimental impact on the respective enzyme activity. 
20 In a preferred embodiment the DNA sequence, capable of regulating the expression of the structural 
gene, is derived from an endogenous gene of the microorganism, in which expression of the DNA sequence 
is intended. Since, as will be shown below in more detail, Saccharomyces cerevisiae is one of the preferred 
microorganism to be used in the present invention, there are a multitude of possible regulating sequences 
known. Some of these well-known sequences have been used to construct expression vectors, as will be 
25 shown below in the examples. In the most preferred embodiment the combination of DNA sequences 
comprises inducible promoters. In this case the expression of xylose reductase and xylitol dehydrogenase 
can be prevented, as long as desired; expression may be started upon addition of a suitable inducer. 

In the most preferred embodiments of the present invention the following Saccharomyces cerevisiae 
promoters are used to regulate the expression of the genes encoding xylose reductase and/or xylitol 
30 dehydrogenase: ADH1 , ADH2, PDC , GAL1/10. 

Depending on the choice of the respective promoter it may be possible to obtain expression levels 
exceeding that of natural expression of both proteins in their original host organism. 

The DNA sequences as well as the combinations of the DNA sequences according to the present 
invention may be introduced in vector molecules. These molecules may be plasmids, which are suitable for 
35 replication in the desired host microorganism and thus should contain a functional origin of replication. 
Alternatively, it is also possible, to use linear DNA fragments carrying the DNA sequence or combination of 
DNA sequences according to the present invention or to use circular DNA molecules being devoid of a 
functional origin of replication. In this case the vector, which is not capable of replication, will be inserted by 
either homologous or nonhomologous recombination into the host chromosome. 
40 Subject of the present invention are further microorganisms, which have received DNA sequences 
comprising the inventive DNA sequences or combinations of DNA sequences coding for xylose reductase 
or xylitol dehydrogenase by recombinant DNA technology. 

Preferred microorganisms are selected from a group consisting of yeast of the genera Saccharomyces, 
Schizosaccharomyces, Schwanniomyces , Kluyveromyces, Pichia , Hansenula , Candida , Debaryomyces , 
45 Metschnikowia, Pachysolen or Paecilomyces or bacteria of the genus Zymomonas . 

From these organisms the most preferred microorganisms are Saccharomyces cerevisiae and 
Schizosaccharomyces pombe and Zymomonas . 

One of the possible applications of the genetically altered yeast strains described above is the 
production of biomass. Since the yeast strains having acquired the ability of expressing xylose reductase 
so and/or xylitol dehydrogenase are maintaining good fermentation abilities, biomass can be produced most 
efficiently by use of these inventive yeast strains. The methods for producing biomass are the usual ones, 
which are known to everybody skilled in the art. The genetically manipulated yeast strains provided in 
compliance with this invention are also suitable for the production of ethanol. The preferred organisms for 
use in the production of ethanol by fermentation are the yeasts Saccharomyces cerevisia and/or 
55 Schizosaccharomyces pombe and/or the bacterium Zymomonas. 

The preferred carbohydrate in the ethanol production is xylose. Thus, strains of Saccharomyces 
cerevisiae and/or Schizosaccharomyces pombe and/or Zymomonas being able to ferment xylose are highly 
advantageous in the production of ethanol. The production of potable spirit or industrial ethanol by use of a 
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genetically manipulated yeast strain according to the present invention can be carried out in a manner 
known per se. The inventive yeast strains have the ability to ferment concentrated carbohydrate solutions, 
exhibit higrTethanol tolerance and have the ability of producing elevated concentrations of ethanol; they 
have a high cell viability for repeated recycling and exhibit remarkable pH-and temperature tolerance. In the 

5 process of xylose production xylose is formed as a waste product from hydrolysis of xylan, which is the 
major compound of hemicellulose. Hence it is of great advantage to use xylose for the production of ethanol 
and/or biomass. The invention is further suitable for the production and isolation of the NAD(P)H linked 
xylose reductase. Due to the reduction reaction this enzyme is suitable for the delivering or recycling (from 
NADPH to an NADP) of the corresponding coenzyme especially in bioreactors. for example for the 

10 production of amino acids. 

A further subject of the present invention is a method for producing the xylose reductase and/or xylitol 
dehydrogenase by cultivating a microorganism according to the present invention under suitable conditions 
and recovering said enzyme or both of them in a manner known per se. The method thus includes the 
expression of a DNA sequence or a combination of DNA sequences according to the present invention in a 

T5 suitable microorganism, cultivating said microorganism under appropriate conditions and isolating the 
enzyme. 

It could be shown, that the level of expression of desired proteins in the inventive microorganisms is 
enhanced, if the microorganism has been selected for efficient fermentation of xylulose. Thus, is is 
preferred, to perform the method for reproducing one or both of the proteins using microorganisms, which 

20 have been selected accordingly. 

Since the present invention provides the cloned genes and the corresponding sequences, the gene 
products can be overproduced in other organisms, e.g. in yeasts of the genera Saccharomyces, Schizosac- 
charomyces, Schwanniomyces, Kluyveromyces, Pichia, Hansenula . Candida , Debaryomyces . Metsch- 
nikowia, Pachysolen, Paecilomyces or bacteria of the genus Zymomonas . The techniques employed for 

25 obtaining expression of the XYL1 (xylose reductase) and/or XYL2 (xylitol dehydrogenase) gene and the 
isolation of the active gene product are the usual ones such as promoter-fusion, transformation, integration 
and selection, and methods of protein isolation, known by the man skilled in the art. 

Generally, said microorganisms have received the DNA sequence or combination of DNA sequences 
via transformation procedures. For each of the possible microorganisms, i.e. the different yeast genera and 

30 bacteria of the genus Zymomonas, there are transformation procedures known. The transformation is 
preferably carried out using a vector, which may be either a linear or circular DNA molecule; in addition, the 
method can be performed using autonomously replicating or integrative molecules as well. In the case, that 
the molecule is supposed to integrate into the genome of the respective host, it is preferred, to use a vector 
containing DNA, which is homologous to the DNA of said intended host microorganism. This measure 

35 facilitates homologous recombination. 

Further subjects of the present invention are the enzymes produced according to the above described 
method. 

The microorganisms according to the present invention may be used in ethanol manufacturing 
processes. Since xylose is a readily available source, which normally is considered to be waste, the ethanol 

40 manufacturing process according to the present invention provides a possibility for ethanol production of 
high economical and ecological interest. 

The ethanol manufacturing process may be adapted for the production of alcoholic beverages or single 
cell protein from substrates containing free xylose, which is preferably released by xylanase and/or 
xylosidase activity from xylan. 

45 According to the present invention there is further provided a method for the production of a desired 
protein in Pichia stipitis. According to this method a structural gene coding for a desired protein is 
expressed under control of the 5* regulating region of the XYL1 and/or XYL2 gene from Pichia stipitis and/or 
the ADH1 promoter of S. cerevisiae and/or the glucoamylase promoter from Schwanniomyce s occidentalis. 
Out of the promoters mentioned before use of the 5* regulating regions of the XYL1 or XYL2 genes is 

so preferred, because these promoters may be induced by adding xylose. Pichia stipitis , when used as a host 
organism, exhibits the great advantage of having an efficient secretion system. This facilitates an efficient 
expression not only of proteins, which stay inside the cell, but also of proteins, which are continuously 
secreted into the medium. A further advantage of the Pichi a stipitis expression system is the possibility of 
using xylose as a substrate. Xylos is a rather inexpensive, readily available nutri nt. 

55 The invention will be discussed in detail by way of the following figures and examples. 

BRIEF DESCRIPTION OF THE FIGURES: 
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Fig. 1 

A: restriction map of the DNA fragment encoding the xylose reductase gene (XYL1) 
E: EcoR1, H: Hindlll, B: Bam HI, N: Nco l, 
P: Pvull, Ps: Psti 

B: restriction map of the DNA fragment encoding the xylitol dehydrogenase gene ( XYL2 ) 
Ba: BamHI. B: Bg1 II, E: EcoR1 , X: Xbal, S: Sail 
Fig. 2 

A) Nucleotide sequence of the XYL1 structural gene including its 5'- and 3 f -flanking sequences and 
the corresponding amino acid sequence. 

B) Nucleotide sequence of the XYL2 structural gene including its 5 1 - and 3'-flanking sequences and 
the corresponding amino acid sequence. 

Fig. 3 S. cerevisiae and S. pombe expression vector. Plasmid pRD1 contains both the xylose reductase 
gene and xylitol dehydrogenase gene under control of their original promoters. 

Fig. 4 Fermentation curve of PK4 grown in YNB, 2% xylose medium. The culture was inoculated with 
10 s cells/ml from a xylose grown preculture. The figure shows xylose consumption and conversion into 
ethanol with a theoretical maximum yield. 

Fig. 5 (1,2) Construction scheme for constructing the vector pBRPGAM. For constructing this vector, the 
3.8 kb EcoRI-Pvull-fragment from pBRSwARSGAM containing the functional GAM promoter and base 
pairs 1 to~208 oflhe coding GAM sequence was ligated to the small EcoRI-Pvull-fragment of pBR322. 
Fig. 6 (1,2) Construction scheme for constructing the vector pBRGCL For constructing this vector, the 
3.4 kb Pvull-fragment of pCT603 containing the structural gene for xylose starting with nucleotide + 122 
was ins~erted into the Pvull site of vector pBRPGAM. 

Fig. 7 (1,2) ConstructiorTscheme for constructing the vector pMPGCl-2. The 6.5 kb Bam HI-Pstl-fragment 
of pBRGCI containing the cellulase gene under control of the GAM promoter was ligated with the large 
Bam HI-Pstl-fragment of pCJD5-1. 

EXAMPLES 

Materials and Methods 

I. Microorganisms and cultivation 

Yeast strains: 

1 . S. cerevisiae: 

a) XJB3-1B (MAT a, met6, gal2) was obtained from the Yeast Genetic Stock Center (see Catalogue 
of the Yeast Genetic Stock Center, 6. edition, 1987). 

b) GRF18 (MAT a, leu2-3, leu2-112, his3-11, his3-15) was obtained from G.R. Fink (DSM 3796). 

c) AH22 (MATa, cant, his4-519, teu2-3, Ieu2-112) was obtained from A. Hinnen (DSM 3820). 

2. Schizosaccharomyces pombe (leu1-32, his5-303) (DSM 3796). 

3. p. stipitis CBS5773 (DSM 5855) was obtained from Centraaibureau voor Schimmelcultures, Yeast 
Division, Delft, The Netherlands. 

Yeast strains were grown at 30°C in YP medium (1% yeast extract. 2% bacto pepton) or in 0.67% Difco 
yeast nitrogen base (YNB) without amino acids, optionally supplemented with appropriate amino acids. 
Media were supplied with either 2% xylose or 2% glucose. The yeasts were transformed according to 
Dohmenetal. (1989). 
E. coli strains: 

~~ TTDHSaP (supplied by BRL company, Eggenstein, FRG) 
2. HB101 (DSM 3788) (Bolivar et al., 1977). 

E. coli strains were grown at 37°C in rich medium (LB-medium, Maniatis et al., 1982). The medium 
wasTupplemented wfth penicillin G (100 jig/ml) when selecting for transformants. E. coli transforma- 
tion was carried out as described by Maniatis (1982). 

II. Purification of the XR and XDH proteins from P. stipitis 

Cells were grown under induced conditions to exponential growth phase. To prepare cell-free 
extracts ceils were harvested by centrifugation and were broken with glass beads in a Braun homogen- 
izer using 0.1 M Tris-HCI buffer (pH 7.0). The supernatant obtained following 1 h centrifugation of th 
crude extract (150000 x g) was loaded on an affinity chromatography column (Affi-Gel Blue, 60x50 mm) 
preequilibrated with 5 mM NaPO* buffer (pH 6.8) and eluted with 1.5 mM NAD. The fractions containing 
XR and XDH activity were pooled and dialysed against 20 mM Tris-HCI (pH 7.5). The dialysate was 
subsequently applied to a DEAE-Sephacel anion exchange column preequilibrated with 20 mM Tris-HCI 
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(pH 7.5). Proteins were eluted with a linear gradient (20-250 mM Tris-HCI, pH 7.5). Fractions containing 
the highest activity were pooled, concentrated and loaded on a SDS-PAA-gel. After running the gel was 
stained with 0.1 M KCI and the XR- and XDH-proteinbands were cut out, both proteins were separately 
eluted from the polyacry (amide gel by dialysis using 20 mM NaP04 (pH 8.0), 0.1% SDS; subsequently 
5 the dialysate was concentrated. All buffers contained 0.2 mM DTT (Dithiothreitol) and 0.4 mM PMSF 
(Phenylmethansulfonylfluoride). 

III. Preparation of antisera 

Mice were given intraperitoneal injections of 2-5 ^g protein in Freund complete adjuvant Two weeks 
later the same amount of protein in Freund incomplete adjuvant was injected; a third injection was 
to administered another 2 weeks later omitting Freund adjuvant. Antiserum was harvested six weeks after 
the first injection. 

IV. Immunoscreening 

Antisera raised in mice against purified P. stipitis xylose reductase (XR) and xylitoi dehydrogenase 
(XDH) protein, respectively, were used for screening the cDNA library following the procedure of Huynh 
75 et al. (1985). The antisera were diluted 10.000-fold. Bound antibodies were visualized using an alkaline 
phosphatase-conjugated goat anti-mouse immunoglobulin antibody, followed by a colour development 
reaction with the phosphatase substrate 5-bromo-4-chloro-3-indolyl phosphate (BCIP) in combination with 
nitro blue tetrazolium (NBT). 

V. Isolation of RNA 

20 All procedures were carried out at 0 to 4°C, if not indicated otherwise. All solutions and materials 

were sterilized if possible. P. stipitis cells were grown to midexponential phase in the presence of xylose. 
Yeast cells were harvestecTby centrifugation, washed twice with buffer 1 (20 mM NaCI, 10 mM MgCfe. 
100 mM Tris-HCI, ph 7.6) and suspended in the same buffer (1.25 ml/g cells). 1/10 volume phenol, 200 
nQlm\ heparin, 100 jig/ml cycloheximid and 0.4% SDS were added. Disruption of the cells was carried 

25 out by shaking with glass beads (0.45 - 0.5 mm) in a ratio of glass beads to suspension of 1 :1 (v/v) in a 
Braun homogenizer (Braun, Melsungen). Two volumes of buffer 2 (buffer 1 containing 100 jig/ml heparin, 
50 jig/ml cycloheximid, 2% SDS) were added to the homogenate, cell debris were removed by 
centrifugation (10000 x g, 10 min). The solution was extracted three to five times with phenol/chloroform 
(1:1), once with chloroform/ isoamylalcohol (24:1). The nucleic acid was precipitated by incubating the 

30 aqueous phase with 2,5 volume of ethanol in the presence of 0.2 M NaCI over night at -20°C. The 
precipitate was solubilized in buffer 3 (0.1 M NaCI, 1 mM EDTA, 10 mM Tris-HCI. pH 7.5); SDS and UCI 
were added to a final concentration of 0.1% and 4 M, respectively. The RNA was precipitated over night 
at +4°C. The pellet was washed twice with 70% ethanol and suspended in sterilized H2O before use. 
RNA was stored at -70°C as an ethanol precipitate. 

35 VI. Enzyme assays 

Activities of xylose reductase (EC. 1.1.1.21) and xylitoi dehydrogenase (EC. 1.1.1.9) were measured 
as described by Bruinenberg et al. (1983). Protein was determined with the micro biuret method 
according to Zamenhoff (1957) using bovine serum albumin as standard. 

VII. Gelelectrophoresis 

40 SDS gelelectrophoresis was carried out in 10% PAA according to Laemmli (1970). 

VIII. Immunoblotting 

Detection of antigenic proteins was carried out as described by Towbin et al. (1979) using the 

antisera obtained from mice. The proteins were transferred to a polyvinylidene difluoride microporous 

membran (Miilipore. Immobilon PVDF) and were visualized by a phosphatase-coupled colour reaction 
45 (Blake et al., 1984). Alkaline phosphatase conjugated to goat anti-mouse IgG was obtained from Jackson 

Immunoresearch Lab. (Avondale, USA). 

DC. DNA-sequence analysis 

XYL1 and XYL2 genomic DNA as well as the respective cDNAs were subcloned in pT7T3-18U 

(Pharmacia). Fragments obtained by partial digestion using Exonuclease HI (Henikoff, 1984) were 
50 analysed and sequencing was carried out by the dideoxy method of Sanger et al. (1977) using the T7- 

SequencingTMkit (Pharmacia). Both strands were completely determined by obtaining overlapping 

sequences at every junction. 

X. Construction of a P.stipitis CBS 5773 (DSM 5855) cDNA library 

Total RNA was extracted according to the method described above. Poly (A) -RNA was prepared by 
55 chromatography on an oligo(dT)-c llulos column using essentially the method described by Maniatis et 
al. (1982). A cDNA library in \gt11 was prepared by the method of Gubler and Hoffman (1983) using a 
cDNA synthesis kit (Pharmacia) and in vitro packaging of the recombinant XgtH-DNA according to Hohn 
and Murray (1974) using th in vitro packaging kit supplied by Boehringer, Mannheim (FRG). 
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XI. Preparation of crud extracts 

Cells were grown to late exponential growth phase and washed twic in buffer (10 mM potassium 
phosphate, pH 7.0, 1 mM EDTA, 5 mM £-mercapto ethanol). Cells were broken in an Braun homogenizer 
with an equal volume of glass beads. The supernatant resulting from 5 min centrifugation at 10000 g was 
used in enzyme assays. Extracts for Western blot analysis were boiled in 1% SDS, 5% 0-mercapto 
ethanol, 10 mM potassium phosphate pH 7.0 and 10% glycerol. 
EXAMPLE 1: 

Isolation of the xylose reductase ( XYL1 ) and xylitol dehydrogenase ( XYL2 ) genes. 

A \gt11 cDNA library constructed from poly (A)*-RNA of P. stipitis was screened with mouse 
polyclonal antibodies raised against the purified xylose reductase (XR) and xylitol dehydrogenase (XDH) 
proteins, respectively. Among 110.000 recombinant clones of the amplified cDNA library containing 
about 55.000 primary clones, seven identical . XYL1 clones and three identical XYL2 clones were 
identified and purified. The analysis of the insert size revealed that the XYL1 clones contain two EcoR1 
fragments (0.6 kb and 0.4 kb), whereas the XYL2 clones contain a single 0.55 kb EcoR1 fragment. The 
respective EcoR1 fragments of the Xgtl 1 clones were subcloned into the single EcoR1 site of plasmid 
pT7T3-18U~(Pnarmacia) resulting in plasmids pXRa (containing the 0.4 kb EcoR1 fragment of the XYL1 
clone). pXRb (containing the 0.6 kb EcoR1 fragment of the XYL1 clone) and pXDH (containing the 0.55 
kb EcoR1 fragment of the XYL2 clone). 

"These plasmids were used as a radioactive probe to screen a P. stipitis genomic library, which was 
constructed by ligation of partial Sau3A digested P. stipitis DNA into the single BamH1 site of the S. 
cerevisiae - E. coli shuttle vector~^Ep13 (Broach et al., 1979) resulting in about 60.000 independent 
clones after tfanslormation of E. coli HB101, 

Two plasmids, namely pR1 and pD1 could be isolated and were used for transformation of Sx 
cerevisiae GRF18. XR activity could be detected in the crude extracts of the transformants containing 
pR1, whereas transformants carrying pD1 yielded crude extracts exhibiting XDH activity. In a mitotic 
stability test (Beggs 1978) the LEU2 marker and the XR or XDH gene cosegregated, indicating that pR1 
and pD1 harbour the functionaTXYU (xylose reductase) and XYL2 (xylitol dehydrogenase) gene, 
respectively. 

The plasmids pR1 and pD1 were subjected to restriction enzyme analysis yielding the map of 
restriction sites of the XYL1 (Fig. 1 A) and XYL2 (Fig. 1B) genes, respectively. 

Further subcloning experiments revealed that the XYL1 gene is encoded within a 2.04 kb Bam HI 
genomic fragment. One of the Bam HI sites is not present in the original plasmid pR1. It must have been 
generated during subcloning. fhe~XYL2 gene is encoded within a 1.95 kb Bam HI-Xbal fragment. The 
2.04 BamHI fragment and the 1 .95 kb BamHI-Xbal fragment were subcloned into the multiple cloning site 
of pT7T3-18U resulting in pR2 and pD2. respectively, and subjected to DNA sequence analysis. The 
DNA sequence of the structural gene and of the 5' and 3' non-coding region of the XYL1 and the XYL2 
gene is depicted in Rg. 2A and Fig. 2B, respectively. 

The DNA sequence of the XYL1 gene contains an open reading frames of 954 bp (318 amino acids) 
whereas that of the XYL2 gene comprises an ORE of 1089 bp (363 amino acids). 

The amino acidilieduced from the open reading frames are shown in Fig. 2A and Fig. 2B. The 
sequences correspond to an XR polypeptide and an XDH polypeptide with a calculated molecular weight 
of 35922 and 38526 D, respectively. 
EXAMPLE 2 

Expression of both the xylose reductase and xylitol dehydrogenase gene in S.cerevisiae. 

Saccharomyces cerevisiae was contransformed with pR1 and pD1. The highest measurable activities 
of XR and XDH in S. cerevisiae transformed accordingly correspond to 50% of the activities of both 
enzymes measurable^ P. stipitis crude extracts. In S. cerevisiae the genes were expressed in YNB 
medium containing 2% glucose as a sole carbon source, whereas in P. stipitis expression of both genes 
is repressed by glucose and induced by xylose. Taking into account the copy number of 10 of YEp13 in 
S. cerevisiae and assuming a gene dosage dependent expression one can conclude that the Pichia 
promoters are 20 times less efficient in S. cerevisiae than in P. stipitis. 

Furthermore, a plasmid harbouring~both the XYL1 and XYL2 gene including their original Pichia 
promoters was constructed (Rg. 3). This plasmid pRD1 was used to transform strain GRF18 by selection 
on leucine resulting in the transformant PK1. However, expression was not improved compared to 
cotransformation with separate plasmids. 
EXAMPLE 3 

Construction of an Int grativ vector containing the XYL2 g ne under control of different 
promot rs 
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Different expression vectors using different promoters for integration and gene expression in S. 
cerevisiae were constructed. For example the XYL2 gene was fused to the ADH1 promoter followed by 
homologous integration into the HIS3 locus of S. cerevisiae . The strategy employed was as follows: The 
1 .5 kb Xbal/ EcoR1 fragment containing the xylitol dehydrogenase gene XYL2 was inserted into the 

5 multiple cloning site of pT7T3-18U (Pharmacia) resulting in plasmid pXDH. To eliminate the promoter 
region of the XYL2 gene this plasmid was linearized with Xbal (restriction site 318 bp upstream of the 
initiator ATG codon) and with Pstl to protect the 3' end of the plasmid DNA. The linear plasmid was 
treated with exonuclease III ancTiubsequently with S1 nuclease to remove the DNA between the Xba l 
site and the XYL2 structural gene. The deleted DNA molecules were recircularized, cloned in E. coli and 

io the extent of deletion was determined by dideoxy sequencing. In one of the modified pXDH plasmids the 
5' untranslated region and the four N-terminal amino acids were deleted. However, a new inframe ATG 
initiation codon was created due to the Sph l site from the multiple cloning site. A Bam Hl linker was 
inserted into the Hindlll site of the multiple~cloning site. Subsequently, a 1 .5 kb Bam Hl fragment carrying 
the XYL2 gene could be subcloned into vector pT7T3-18U resulting in additional restriction sites in front 

T5 of the ATG initiation codon. The newly created 5' region is as follows: ATE CCT TGG TGT...(deletion of 
original amino acid 2,3 and 4). 

To complete the 3' untranslated region of the XYL2 gene a 440 bp Eco RI fragment, was inserted into 
the single EcoRI site of the 1.5 kb fragment subcloned in pTTTS-iSU. This 440 bp fragment was 
obtained by~subcloning the 440 bp EcoRI - Bam Hl fragments (see Fig. 1B) into another pT7T3-18U, 

20 removing the BamHl site by cutting wlth ~Bam HI and subsequent fii!ing-in with Klenow polymerase. The 
3' untranslated region could thus be isolated as 440 bp EcoRI fragment. In the single Bam Hl site 
arranged near the 5' terminus of the XYL2 gene, which is provided by the polylinker region, the 1.8 kb 
Bam Hl fragment harbouring the S. cerevisiae His3 gene derived from plasmid YEp6 (Struhl et al. 1979) 
was inserted. To remove one of "the two Bam Hl sites the resultant plasmid was cut with Sail and Xho l 

25 and subsequently recircularized. The resulting plasmid pXDH-HIS3 contains one suitable Bam Hl site in 
front of the ATG initiation codon in which the 1.5 kb Bam Hl fragment, containing the AD H1 - promoter 
(Ammerer, 1983) of S. cerevisiae can be inserted. 

Since this plasmfd does not contain any autonomous replicating sequence for S. cerevisiae this 
plasmid can be used for homologous integration (Orr- Weaver et al. 1981) into the HIS3 locus of any S. 

30 cerevisiae strain. 

In our integration experiments we used a mutagenized XJB3-1B strain called PUA6-1, which was 
isolated according the protocol of Porep, (1987) and Ciriacy, (1986). The resulting integrant PK2 is 
expressing the XYL2 gene under control of the ADH1 promoter leading to an active gene product. 
EXAMPLE 4 

35 Construction of S.cerevisiae and S.pombe integrants expressing both the XYL1 and XYL2 gene. 

To eliminate the promoter egion of the XYL1 gene plasmid pR2 containing the XYL1 gene on a 2,04 
kb BamHl fragment was linearised with Xbal ( restriction site 362 bp upstream of the translation initiation 
ATG codon) and cleaved with Sph l to protect the 3' end of the plasmid DNA. The linear plasmid was 
treated with exonuclease III and subsequently with S1 nuclease to remove the DNA between the Xba l 

40 site and the XYL1 structural gene. The deleted DNA molecules were recircularized, cloned in E. coli and 
the extent of deletion was determined by dideoxy sequencing. In one of the modified pR2 plasmids the 
5* untranslated region was exactly deleted. 

The structural gene was subcloned as a Hindlll- Bam HI fragment into the corresponding sites of 
Ylp366 (Myers et al. 1986). In addition the ADHT~promoter was subcloned into the Hind lll site by blunt 

45 end ligation resulting in plasmid pXR-LEU2. Since this plasmid does not contain any autonomous 
replicating sequence for S. cerevisiae this plasmid can be used for homologous integration (Orr-Weaver 
et al. 1981) into the LEU2~Tocus of any S. cerevisiae strain, e.g. strain PK2. The resulting integrant PK3 is 
expressing both the XYL1 and XYL2 genes under control of the ADH1 promoter leading to active gene 
products. For expression studies in Schizosaccharomyces, S. pombe was transformed with both 

50 plasmids pXDH-HIS3 and pXR-LEU2 selecting for histidine and leucine. After extensive screening of the 
transformants for growth on xylose one transformant called AS1 could be isolated expressing both the 
XYL1 and XYL2 gene under control of the ADH1 promoters. 

In the same manner other S. cerevisiae promoters, e.g. pyruvate decarboxylase ( PDC) promoter 
(Kellermann & Hollenberg, 1 988)ralcoholdehydrogenas 2 (ADH2) promoter (Russell et al., 1983) or th 

55 galactokinase ( GAL1/10) promoter from plasmid pBM272, which is derived from plasmid pBM150 
(Johnston and Davis, 1984) by introducing a Hindlll site immediately following the Bam Hl site, led to 
expression of active XYL1 and XYL2 gene product in S. cerevisiae . 

In another set of experiments two suitable restriction sites Bam Hl (position -9) and Sail (position - 15) 



8 



EP 0 450 430 A2 



were introduced just in front of the XYL1 and XYL2 genes. 
XYL1 :5'attcttttcta GTCGAC GGATCCAAGATGCCTTCTATT 
...TAA terminator3' 

XYL2:5 T cccctaatact GTCGAC GGATCCAAGATGACTGCTAAC 
...TAA terminator3' 

These modifiations were introduced by site directed mutagenesis of the 5* region using the site directed 
mutagenesis kit supplied by Amersham according to the instructions of the manufacturer. These 
restriction sites offer the possibility to fuse any promoter just in front of the ATG initiation codon. In 
addition the gene under control of a desired promoter can be isolated as a well defined fragment for 
insertion into sequences suitable for homologous integration. 

For industrial or commercial purposes it is desirable to construct stable production strains of S. 
cerevisiae and/or S. pombe. Therefore both genes under control of the constitutive ADH1 promoter were 
integrated withouFany bacterial sequence into the chromosome of S. cerevisiae strain PUA6-1 via 
homologous integration (Orr- Weaver et al.1981). Integration into the HO homothallism gene (Russel et aJ. 
1986), ARS-sequence (Stinchcomb et al., 1978) or into the ADH4 gene (Paquin et al., 1986) by 
cotransformation with pJW6 (Fogel and Welch. 1982) is preferred resulting in strains PK3(HO), PK3(ARS) 
and PK3(ADH4). In the case of S. pombe the integration mainly occurs via illegitimate recombination. 
Hence only a few of the S. pombe integrants exhibit XR and XDH activities and have the same 
fermentation and growth properties as the wild type. 

The S. cerevisiae integrants PK3, PK3(HO). PK3(ARS) and PK3(ADH4) may be improved for efficient 
assimilation of xylulose. 
EXAMPLE 5 

Isolation of a S.cerevisiae mutant efficiently assimilating xylulose. 

S. cerevisiae strain XJB3-1B grows slowly on media containing xylulose as a sole carbon source 
(doubling time 10 hours). According to a protocol described by Porep (Porep, 1987) a mutant, PUA3, 
was isolated, which utilized xylulose more efficiently than wild type S. cerevisiae strains, resulting in a 
doubling time of approximately four hours for growth on xylulose as a sole carbon source. 

Mutant strain PUA3 also converts xylulose into ethanot in the absence of respiration (Porep. 1987). In 
order to obtain the PUA genotype in combination with an auxiliary marker (LEU2) useful in yeast 
transformation, strain PUA3 was crossed to AH22 (leu2 h|s4). F rom a sporulating culture of the 
AH22xPUA3 diploid meiotic spore progenies were isolated which were Ieu2 and had the ability of 
efficient xylulose-utilization as observed in the original mutant. PUA3. In an analogous experiment the 
PUA genotype was combined with Ieu2 and his3 auxiliary markers by crossing strain GRF18 and PUA 
strain and subsequent meiotic spore isolation. This resulted in strain PUA6-1 which was PUA Ieu2 his3 . 
EXAMPLE 6 

Isolation of a S.cerevisiae mutant efficiently converting xylose into ethanol. 

Strain PUA6-1 containing the XYL1 and XYL2 genes chromosomally integrated (See Examples 3 and 
4) was able to grow on xylose as a sole carbon source whereas the untransformed PUA6-1 strain was 
completely negative on YNB xylose media. Doubling time of the transformant strain PK3 was 4 hours on 
YNB 1% xylose (for comparison, doubling time on YNB 1% glucose: 2 hours). Since ethanol production 
was inefficient in this strain when grown on xylose and no xylose growth was observed in the absence of 
respiration a mutant strain with improved capability in converting xylose to ethanol was selected as 
follows: 10 s PK3 cells were mutagenized with UV (254 nm) using conditions allowing 20% to 40% of the 
cells survival. The surviving cells were grown for approximately 30 generations in YNB 2% xylose liquid 
media. After plating on xylose solid media isolates were obtained which grow significantly faster than the 
parent strain PK3. One isolate was further propagated and used for selection of a mutant able to grow on 
YNB 2% xylose plates supplemented with 2 mg/l antimycin A in order to block respiratory metabolism. 
This procedure yielded a mutant (PK4) which was able to convert xylose significantly more efficiently to 
ethanol than the original transformant strain PK3. A typical xylose fermentation protocol is depicted in 
Fig. 4. The ethanol yield was approximately 40% of the initial xylose. This yield corresponds to 
approximately 80% of the theoretical maximum yield of ethanol from xylose conversion. 
EXAMPLE 7 

Expression of heterologous gen s in Pichiastipitis 

Following UV mutagenesis of Pichia stipitis strain CBS 5773 (DSM 5855) a trp5 mutant was isolated. 
The trp5 mutation was identified by examining indol accumulation according to Hagedorn and Ciriacy 
(Hagedorn and Ciriacy, 1989). 

For expression in Pichia stipitis plasmids were constructed which contain a replicon from Schwan- 
niomyces occidentalis (Sw^SlMhe TRPS-gene from S. cerevisiae (Dohmen et a!.. 1989) as a selective 
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marker and in addition a glucoamy!ase(GAM)-cellu!ase (celD) gene fusion under control of the 
glucoamylase promoter, in a first step the 3.8 kb EcoRl-Pvull-fragment from plasmid pBRSwARSGAM 
(Rg. 5, described in EP 89 107 780) was isolated "and inserted into the 2296 bp EcoRl-Pvull-fragment 
from pBR322 carrying the ampiciilin resistance gene and the bacterial origin of replication, resulting in 

5 plasmid pBRGAM (Fig. 5). In addition to pBR322 derived sequences this plasmid carries 3.6 kb derived 
from the 5' noncoding region of the glucoamylase gene from Schwanniomyces occidentalis and 
nucleotides 1 to 208 coding for the N-terminal part including the signal sequence of the glucoamylase. 
Subsequently, a 3.4 kb Pvull-fragment derived from plasmid pCT603 (Joliff et ah. 1986) containing the 
coding region of the celD-genes from Clostridium thermocellum with the exception of 120 bp 

to (corresponding to 40 amino acids) starting with the 5' terminus of the coding region was inserted into the 
Pvulll site of the pBRGAM resulting in pBRGCI (Rg. 6). For construction of a P. stipitis expression 
vector plasmid pCJD5-1 (EP 87 110 370.1) was cleaved with Bam HI/Pstl and iigated with a 6.5 kb 
BamHI-Pstl-fragment from pBRGd. The resulting plasmid was termed pMPGC1-2 (Fig. 7). The above 
diicribid'P. stipitis mutant trp5 was transformed with pMPGCl-2 and the transformants were identified 

is by their capability to grow on medium free of tryptophan (tryptophan prototrophy). Transformants were 
examined for cellulase activity using the congo red assay (Teather & Wood, 1982). The transformants 
constitutively produce active cellulase (endoglucanase D) of Clostridium thermocellum , which is secreted 
into the media, indicating, that the promoter and the signal sequence encoded by the glucoamylase gene 
may control expression of a heterologous gene and secretion of the gene product into the medium. 

20 Subsequently plasmid pMPGC1-2 was modified in order to substitute the glucoamylase promotor 

either by the S. cerevisiae ADH1 -promoter or the inventive 5' regions of the XYL1 or XYL2 gene, 
respectively. It oDuld be shown, that the expression under control of the XYL1 or XYL2 promoter region 
may be induced by xylulose, while being repressed by glucose. 
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Claims 



1. A DNA sequence, charact rized in that said DNA sequence comprises a structural gene coding for a 
xylose reductase and/or xylitol dehydrogenase, and is capable of expressing said polypeptide(s) in a 

5 microorganism. 

2. The DNA sequence according to claim 1 , characterized in that said DNA sequence is derived from a 
yeast, preferably from a yeast selected from a group consisting of the genera Schwanniomyces. 
Saccharomyces . Kluyveromyces , Pichia . Hansenula , Candida , Debaryomyces , Metschnikowia , and 

10 Pachysoien. 

3. The DNA sequence according to claim 2, characterized in that the yeast is Pichia stipitis, preferably 
Pichia stipitis CBS 5773 (DSM 5855). 

75 4- The DNA sequence according to any of claims 1 to 3, characterized in that said sequence comprises 
the structural gene encoding a xylose reductase having the following amino acid sequence: 
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10 

MPS IKLNSGY 

20 

DMPAVGFGCW 

30 

KVDVDTCSEQ 

40 

IYRAIKTGYR 

50 

LFDGAEDYAN 

60 

EKLVGAGVKK 

70 

AIDEGIVKRE 

80 

DLFLTSKLWN 

90 

NYHHPDNVEK 

100 

ALNRTLS DLQ 

110 

VDYVDLF LIH 

120 

FPVTFKFVPL 

130 

EEKYPPGFYC 



140 

ss GKGDNFDYED 
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150 

VPILETWKAL 

„ „ 160 
EKLVKAGKIR 

170 

SIGVSNFPGA 

_ „ 180 
LLLDLLRGAT 

190 

IKPSVLQVEH 

200 

HPYLQQPRLI 

210 

EFAQSRGIAV 

220 

TAYSSFGPQS 

230 

FVELNQGRAL 

240 

NTSPLFENET 

250 

IKAIAAKHGK 

260 

SPAQVLLRWS 

so 270 

SQRGIAI IPK 
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280 

SMTVPRLLEN 
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290 

KDVNSPDLDE 

300 

QDPADIAKLD 

310 

INLRFNDP WD 



WDKlPlpv 



The DNA sequence according to claim 4, comprising the following nucleotide sequence: 
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-350 

GGATCCACAGACACTAATTGGTTCTA 

-310 

CATTATTCGTGTTCAGACACAAACCCCAGC 
-290 

GTTGGCGGTTTCTGTCTGCGTTCCTCCAGC 

-250 

ACCTTCTTGCTCAACCCCAGAAGGTGCACA 
-230 

CTGCAGACACACATACATACGAGAACCTGG 

-190 

AACAAATATCGGTGTCGGTGACCGAAATGT 
-170 

GCAAACCCAGACACGACTAATAAACCTGGC 

-130 

AGCTCCAATACCGCCGACAACAGGTGAGGT 
-110 

GACCGATGGGGTGCCAATTAATGTCTGAAA 

-70 

ATTGGGGTATATAAATATGGCGATTCTCCG 
-50 

GAGAATTTTTCAGTTTTCTTTTCATTTCTC 



-10 

CAGTATTCTTTTCTATACAACTATACTACA 

10 30 
ATGCCTTCTATTAAGTTGAACTCTGGTTAC 

50 

GACATGCCAGCCGTCGGTTTCGGCTGTTGG 
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70 90 

AAAGTCGACGTCGACACCTGTTCTGAACAG 

110 

ATCTACCGTGCTATCAAGACCGGTTACAGA 

130 150 
TTGTTCG ACGGTGC CG AAG ATT ACG C C AAC 

170 

GAAAAGTTAGTTGGTGCCGGTGTCAAGAAG 

190 210 
GCCATTGACGAAGGTATCGTCAAGCGTGAA 

230 

GACTTGTTCCTTACCTCCAAGTTGTGGAAC 

250 270 
AACTACCACCACCCAGACAACGTCGAAAAG 

290 

GCCTTGAACAGAACCCTTTCTGACTTGCAA 

310 330 
GTTGACTACGTTGACTTGTTCTTGATCCAC 

350 

TTCCCAGTCACCTTCAAGTTCGTTCCATTA 

370 390 
GAAGAAAAGTACCCACCAGGATTCTACTGT 

410 

GGTAAGGGTGACAACTTCGACTACGAAGAT 

430 450 
GTTCCAATTTTAGAGACCTGGAAGGCTCTT 



470 

GAAAAGTTGGTCAAGGCCGGTAAGATCAGA 
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490 5io 
TCTATCGGTGTTTCTAACTTCCCAGGTGCT 

530 

TTGCTCTTGGACTTGTTGAGAGGTGCTACC 

550 570 
ATCAAGCCATCTGTCTTGCAAGTTGAACAC 

590 

CACCCATACTTGCAACAACCAAGATTGATC 

610 630 
GAATTCGCTCAATCCCGTGGTATTGCTGTC 

650 

ACCGCTTACTCTTCGTTCGGTCCTCAATCT 

670 690 
TTCGTTGAATTGAACCAAGGTAGAGCTTTG 

710 

AACACTTCTCCATTGTTCGAGAACGAAACT 

730 750 
ATCAAGGCTATCGCTGCTAAGCACGGTAAG 

770 

TCTCCAGCTCAAGTCTTGTTGAGATGGTCT 

79C 810 
TCCCAAAGAGGCATTGCCATCATTCCAAAG 

830 

TCCAACACTGTCCCAAGATTGTTGGAAAAC 

850 870 
AAGGACGTCAACAGCTTCGACTTGGACGAA 

890 

CAAGATTTCGCTGACATTGCCAAGTTGGAC 
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910 930 
ATCAACTTGAGATTCAACGACCCATGGGAC 

950 

TGGGACAAGATTCCTATCTTCGTCTAAGAA 

970 99 0 
GGTTGCTTTATAGAGAGGAAATAAAACCTA 

1010 

ATATACATTGATTGTACATTTAAAATTGAA 

1030 1050 
TATTGTAGCTAGCAGATTCGGAAATTTAAA 

1070 

ATGGGAAGGTGATTCTATCCGTACGAATGA 

1090 1110 
TCTCTATGTACATACACGTTGAAGATAGCA 

1130 

GTACAGTAGACATCAAGTCTACAGATCATT 

1150 1170 
AAACATATCTTAAATTGTAGAAAACTATAA 

1190 

ACTTTTCAATTCAAACCATGTCTGCCAAGG 

1210 12 30 
AATCAAATGAGATTTTTTTCGCAGCCAAAC 

1250 

TTGAATCCAAAAATAAAAAACGTCATTGTC 

1270 129 0 
TGAAACAACTCTATCTTATCTTTCACCTCA 

1310 

TCAATTCATTGCATATCATAAAAGCCTCCG 
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1330 1350 
ATAGCATACAAAACTACTTCTGCATCATAT 

1370 

CTAAATCATAGTGCCATATTCAGTAACAAT 

1390 1410 
ACCGGTAAGAAACTTCTATTTTTTTAGTCT 

1430 

GCCTTAACGAGATGCAGATCGATGCAACGT 

1450 1470 
AAGATCAAACCCCTCCAGTTGTACAGTCAG 

1490 

TCATATAGTGAACACCGTACAATATGGTAT 

1510 1530 
CTACGTTCAAATAGACTCCAATACAGCTGG 

1550 

TCTGCCCAAGTTGAGCAACTTTAATTTAGA 

1570 1590 
GACAAAGTCGTCTCTGTTGATGTAGGCACC 

1610 

ACACATTCTTCTCTTGCCCGTGAACTCTGT 

1630 1650 
TCTGGAGTGGAAACATCTCCAGTTGTCAAA 

1670 

TATCAAACACTGACCAGGCTTCAACTGGTA 
1690 

GAAGATTTCGTTTTCGGGATC 
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The DNA sequence according to any of claims 1 to 3. characterized in that said sequence comprises 
the structural gene encoding a xylitol dehydrogenase having the following amino acid sequence: 
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10 

MTANPS LVLN 

20 

KI DDISFETY 

30 

DAPEISEPTD 

40 

VLVQVKKTGI 



20 50 

CGSDIHFYAH 



10 



IS 



25 



30 



35 



60 

GRIGNFVLTK 

70 

PMVLGHESAG 

80 

TVVQVGKGVT 

90 

SLKVGDNVAI 

100 

EPGIPSRFSD 

110 

EYKSGHYNLC 

so 120 
PHMAFAATPN 
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SKEGEPNPPG 
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140 

TLC KYFKSPE 

150 

DFLVKLPDHV 

160 

SLELGALVEP 

170 

LSVGVHASKL 

180 

GSVAFG DYVA 

190 

VFGAGPVGLL 

200 

AAAVAKTFGA 

210 

KGVIVVDIFD 

220 

NKLKMAKDIG 

230 

AATHTFNSKT 

240 

GGSEELIKAF 

250 

GGNVPNVVLE 

260 

CTGAEPCIKL 
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270 

GVDAIAPG GR 

280 

FVQVGNAAGP 

290 

VSFPITVFAM 

300 

KELTLFGSFR 

310 

YGFNDYKTAV 

~ ^ 320 
GIFDTNYQNG 

330 

RENAPIDFEQ 

340 

LITHRYKFKD 
AIEAYDLVRA 



360 

GKGAVKCLID 



G P E * 

The DNA sequence according to claim 6, comprising the following nucleotide sequence: 
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-310 -290 
TCTAGACCACCCTAAGTCGTCCCTATGTCG 

-270 

TATGTTTGCCTCTACTACAAAGTTACTAGC 

-250 -230 
AAATATCCGCAGCAACAACAGCTGCCCTCT 

-210 

TCCAGCTTCTTAGTGTGTTGGCCGAAAAGG 

-190 -170 
CGCTTTCGGGCTCCAGCTTCTGTCCTCTGC 

-150 

GGCTGCTGCACATAACGCGGGGACAATGAC 

-130 -110 
TTCTCCAGCTTTTATTATAAAAGGAGCCAT 

-90 

CTCCTCCAGGTGAAAAATTACGATCAACTT 

-70 -50 
TTACTCTTTTCCATTGTCTCTTGTGTATAC 

-30 

TCACTTTAGTTTGTTTCAATCACCCCTAAT 

-10 10 
ACTCTTCACACAATTAAAATGACTGCTAAC 

30 

CCTTCCTTGGTGTTGAACAAGATCGACGAC 

50 70 
ATTTCGTTCGAAACTTACGATGCCCCAGAA 
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90 

ATCTCTGAACCTACCGATGTCCTCGTCCAG 

110 130 
GTCAAGAAAACCGGTATCTGTGGTTCCGAC 

150 

ATCCACTTCTACGCCCATGGTAGAATCGGT 

170 190 
AACTTCGTTTTGACCAAGCCAATGGTCTTG 

210 

GGTCACGAATCCGCCGGTACTGTTGTCCAG 

230 250 
GTTGGTAAGGGTGTCACCTCTCTTAAGGTT 

270 

GGTGACAACGTCGCTATCGAACCAGGTATT 

290 310 
CCATCCAGATTCTCCGACGAATACAAGAGC 

330 

GGT C ACT AC AA CTTGTGTC CTCAC ATGGC C 

350 370 
TTCGCCGCTACTCCTAACTCCAAGGAAGGC 

390 

GAACCAAACCCACCAGGTACCTTATGTAAG 

410 430 
TACTTCAAGTCGCCAGAAGACTTCTTGGTC 

450 

AAGTTGCCAGACCACGTCAGCTTGGAACTC 
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470 490 
GGTGCTCTTGTTGAGCCATTGTCTGTTGGT 

510 

GTCCACGCCTCCAAGTTGGGTTCCGTTGCT 

530 550 
TTCGGCGACTACGTTGCCGTCTTTGGTGCT 

570 

GGTCCTGTTGGTCTTTTGGCTGCTGCTGTC 

590 610 
GCCAAGACCTTCGGTGCTAAGGGTGTCATC 

630 

GTCGTTGACATTTTCGACAACAAGTTGAAG 

650 670 
ATGGCCAAGGACATTGGTGCTGCTACTCAC 

690 

ACCTTCAACTCCAAGACCGGTGGTTCTGAA 

710 730 
GAATTGATCAAGGCTTTCGGTGGTAACGTG 

750 

C C AAA C G T C G TTTT G G AAT GT A CTG GT G CT 

770 790 
GAACCTTGTATCAAGTTGGGTGTTGACGCC 

810 

ATTGCCCCAGGTGGTCGTTTCGTTCAAGTT 

830 850 
GGTAACGCTGCTGGTCCAGTCAGCTTCCCA 
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870 

ATCACCGTTTTCGCCATGAAGGAATTGACT 

890 910 
TTGTTCGGTTCTTTCAGATACGGATTCAAC 

93Q 

GACTACAAGACTGCTGTTGGAATCTTTGAC 

950 970 
ACTAACTACCAAAACGGTAGAGAAAATGCT 

990 

CCAATTGACTTTGAACAATTGATCACCCAC 

1010 1030 
AGATACAAGTTCAAGGACGCTATTGAAGCC 

1050 

TACGACTTGGTCAGAGCCGGTAAGGGTGCT 

1070 1090 
GTCAAGTGTCTCATTGACGGCCCTGAGTAA 

1110 

GTCAACCGCTTGGCTGGCCCAAAGTGAACC 

1130 1150 
AGAAACGAAAATGATTATCAAATAGCTTTA 

1170 

TAGACCTTTATCGAAATTTATGTAAACTAA 

1190 1210 
TAGAAAAGACAGTGTAGAAGTTATATGGTT 

1230 

GCATCACGTGAGTTTCTTGAATTCTTGAAA 
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"50 1270 
GTGAAGTCTTGGTCGGAACAAACAAACAAA 

1290 

AAAATATTTTCAGCAAGAGTTGATTTCTTT 

13 10 1330 
TCTGGAGATTTTGGTAATTGACAGAGAACC 

1350 

CCTTTCTGCTATTGCCATCTAAACATCCTT 

137 0 . 1390 

GAATAGAACTTTACTGGATGGCCGCCTAGT 

1410 

GTTGAGTATATATTATCAACCAAAATCCTG 

1430 1450 
TATATAGTCTCTGAAAAATTTGACTATCCT 

1470 

AACTTAACAAAAGAGCACCATAATGCAAGC 

1490 1510 
TCATAGTTCTTAGAGACACCAACTATACTT 

1530 

AGCCAAACAAAATGTCCTTGGCCTCTAAAG 

1550 1570 
AAGCATTCAGCAAGCTTCCCCAGAAGTTGC 

1590 

ACAACTTCTTCATCAAGTTTACCCCCAGAC 

1610 1630 
CGTTTGCCGAATATTCGGAAAAGCCTTCGA 

CTATAGTGGATCC 



29 



EP 0 450 430 A2 



8. The DNA sequence according to any of claims 1 to 7, characterized in that it is obtained by 
recombinant DNA technology from natural and/or cDNA and/or chemically synthesised DNA. 

9. A combination of DNA sequences, characterized in that said combination comprises a first DNA 
sequence according to any of claims 1 to 8 and one or more further DNA sequences capable of 
regulating the expression of a structual gene encoded by said DNA sequence in a host microorganism. 

10. A combination of DNA sequences according to claim 9, characterized in that said combination 
comprises all modifications of the DNA sequences retaining their capability to express a functional 
enzyme having xylose reductase or xylitol dehydrogenase activity. 

11. A combination of DNA sequences according to claim 9 or 10 t characterized in that said structural gene 
contains DNA sequences derived from the structural gene coding for xylose reductase or xylitol 
dehydrogenase which modify said protein product while retaining its functions in such a way that said 
protein product is expressed as a gene product having enzymatic activity. 

12. A combination of DNA sequences according to any of claims 8 to 11. characterized in that said DNA 
sequences capable of regulating the expression of said structural gene in a host microorganism are 
derived from said host microorganism. 

13. A combination according to claim 12, characterized in that said DNA sequences capable of regulating 
the expression are inducible promoters. 

14. A combination according to claims 13, characterized in that said DNA sequences capable of regulating 
the expression are selected from the following promoters: ADH1, ADH2, PDC, GAL1/10. 



15. A combination according to any of claims 12 to 14 characterized in that said DNA sequence capable of 
regulating the expression of said structural gene is a strong promoter, leading to over expression of the 
protein encoded by said structural gene. 

16. A vector, characterized in that said vector comprises DNA sequences according to any of claims 1 to 8 
or a combination of DNA sequences according to any of claims 9 to 15. 

17. A vector according to claim 16, characterized in that said vector is selected from the group comprising 
the plasmids pR1, pR2, pD1, pD2, pRD1, pXRa. pXRb pXDH. pXR, pXDH-HlS3, pXR-LEU2. 

18. A microorganism, characterized in that said microorganism is capable of expressing a xylose reductase 
and/or a xylitol dehydrogenase as a result of having received DNA sequences comprising the DNA 
sequences according to any of claims 1 to 8 or a combination of DNA sequences according to any of 
claims 9 to 15, coding for said xylose reductase and/or xylitol dehydrogenase by recombinant DNA 
technology. 

19. The microorganism according to claim 18, characterized in that said microorganism is selected from a 
group consisting of yeast of the genera Saccharomyces , Schizosaccharomyces , Schwanniomyces, 
Kluyveromyces , Pichia. Hansenula t Candida , Debaryomyces , Metschnikowia, Pachysolen or 
Paecilomyces or bacteria of the genus Zymomonas. 



20. The microorganism according to claim 1 9, characterized in that said microorganism is Saccharomyces 
cerevisiae. 

21. The microorganism according to claim 19, characterized in that said microorganism is Schizosac- 
charomyces pombe. 



22. The microorganism according to any of claims 18 to 21, characterized in that said DNA sequence or 
combination of DNA sequences is integrated into the genome of said microorganism. 
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23. The microorganism according to any of claims 18 to 22, characterized in that said microorganism is 
useful in biomass production, in food industry and fermentation processes. 

24. The microorganism according to claim 23, characterized in that said microorganism is useful for 
s fermentation of xylose into ethanol. 

25. A method for producing xylose reductase and/or xylitol dehydrogenase by cultivating a microorganism 
according to any of claims 18 to 22 under suitable conditions and recovering said enzyme(s) in a 
manner known per se. 

70 

26. The method according to claim 25, characterized in that said microorganism is selected for efficient 
fermentation of xylulose. 

27. The method according to claim 25 or 26, characterized in that said microorganism has received said 
is DNA sequences or said combination of DNA sequences by transformation using a vector, said vector 

being preferably a DNA fragment or a plasmid. 

28. The method according to claim 27, characterized in that said vector contains DNA, which is homolo- 
gous to DNA of said microorganism, leading to integration into the genome of said microorganism. 

20 

29. A xylose reductase produced according to the method of any of claims 25 to 28. 

30. A xylitol dehydrogenase produced according to the method of any of claims 25 to 28. 

25 31. An ethanol manufacturing process, characterized in that a microorganism according to any of claims 18 
to 24 is used. 

32. A process according to claim 31, characterized in that the fermentation process is adapted for the 
production of alcoholic beverages or single cell protein produced from substrates containing free 

30 xylose, preferably released by xylanase and/or xylosidase activity. 

33. A process for production of biomass, characterized in that a host microorganism according to any of 
claims 18 to 24 is used. 

35 34. A process for recycling of NADP* from NADPH using xylose reductase. 

35. A method for producing a desired protein in Pichia stipitis , comprising expression of the structural gene 
encoding said desired protein under control of the 5' regulating region of the XYL1 and/or XYL2 gene of 
Pichia stipitis and/or the ADH1 promoter from Saccharomyces cerevisiae and/or the glucoamylase 
40 promoter from Schwanniomyces occidental. 



45 



50 



55 
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Fig.2A (1) 

-350 

GGATCCACAGACACTAATTGGTTCTA 

-310 

CATTATTCGTGTTCAGACACAAACCCCAGC 
-290 

GTTGGCGGTTTCTGTCTGCGTTCCTCCAGC 

-250 

ACCTTCTTGCTCAACCCCAGAAGGTGCACA 
-230 

CTG CAGACACACATAC ATACG AGAACCTGG 

-190 

AACAAATATCGGTGTCGGTGACCGAAATGT 
-170 

GCAAACCCAGACACGACTAATAAACCTGGC 

-130 

AGCTCCAATACCGCCGACAACAGGTGAGGT 
-110 

GACCGATGGGGTGCCAATTAATGTCTGAAA 

-70 

ATTGGGGTATATAAATATGGCGATTCTCCG 
-50 

GAGAATTTTTCAGTTTTCTTTTCATTTCTC 

-10 

CAGTATTCTTTTCTATACAACTATACTACA 

10 30 
ATGCCTTCTATTAAGTTGAACTCTGGTTAC 
MPSIKLNSGY 
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Fig.2A (2) 



50 

GACATGCCAGCCGTCGGTTTCGGCTGTTGG 
DMPAVGFGCW 

70 90 
AAAGTCGACGTCGACACCTGTTCTGAACAG 
KVDVDTCSEQ 

110 

ATCTACCGTGCTATCAAGACCGGTTACAGA 
IYRAI KTGYR 

130 150 
TTGTTCGACGGTGCCGAAGATTACGCCAAC 
LFDGAEDYAN 

170 

G AAAAGTTAGTTGGTG CCGGTGTC AAG AAG 
EKLVGAGVKK 

190 210 
GCCATTGACGAAGGTATCGTCAAGCGTGAA 
AIDEGIVKRE 

230 

GACTTGTTCCTTACCTCCAAGTTGTGGAAC 
DLFLTSKLWN 

250 270 
AACTACCACCACCCAGACAACGTCGAAAAG 
NYHHPDNVEK 

290 

GCCTTGAACAGAACCCTTTCTGACTTGCAA 
ALNRTLSDLQ 

310 330 
GTTGACTACGTTGACTTGTTCTTGATCCAC 
VDYVDLFL IH 
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Fig.2A (3) 



350 

TTCCCAGTCACCTTCAAGTTCGTTCCATTA 
FPVTFKFVPL 

370 390 
GAAGAAAAGTACCCACCAGGATTCTACTGT 
EEKYPPGFYC 

410 

GGTAAGGGTGACAACTTCGACTACGAAGAT 
GKGDNFDYED 

430 450 
GTTCCAATTTTAGAGACCTGGAAGGCTCTT 
VPILETWKAL 

470 

GAAAAGTTGGTCAAGGCCGGTAAGATCAGA 
EKLVKAGKIR 

490 510 
TCTATCGGTGTTTCTAACTTCCCAGGTGCT 
.SIGVSNFPGA 

530 

TTGCTCTTGGACTTGTTGAGAGGTGCTACC 
LLLDLLRGAT 

550 570 
ATCAAGCCATCTGTCTTGCAAGTTGAACAC 
IKPSVLQVEH 

590 

CACCCATACTTGCAACAACCAAGATTGATC 
HPYLQQPRLI 

610 630 
GAATTCGCTCAATCCCGTGGTATTGCTGTC 
EFAQSRGIAV 
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Fig.2A (4) 



650 

ACCGCTTACTCTTCGTTCGGTCCTCAATCT 
TAYSSFGPQS 

670 690 
TTCGTTGAATTGAACCAAGGTAGAGCTTTG 
FVELNQGRAL 

710 

AACACTTCTCCATTGTTCGAGAACGAAACT 
NTSPLFEN ET 

730 750 
ATCAAGGCTATCGCTGCTAAGCACGGTAAG 
IKAIAAKHGK 

770 

TCTCCAGCTCAAGTCTTGTTGAGATGGTCT 
SPAQVLLRWS 

790 810 
TCCCAAAGAGGCATTGCCATCATTCCAAAG 
SQRGIAIIPK 

830 

TCCAACACTGTCCCAAGATTGTTGGAAAAC 
S NTVPRLLEN 

850 870 
AAGGACGTCAACAGCTTCGACTTGGACGAA 
KDVNSFDLDE 

890 

CAAGATTTCGCTGACATTGCCAAGTTGGAC 
QDFADIAKLD 

910 930 
ATCAACTTGAGATTCAACGACCCATGGGAC 
INLRFNDPWD 
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Fig.2A (5) 

950 

TGGGACAAGATTCCTATCTTCGTCTAAGAA 
WDKIPIFV* 

970 990 
GGTTGCTTTATAGAGAGGAAATAAAACCTA 

1010 

ATATACATTGATTGTACATTTAAAATTGAA 

1030 1050 
TATTGTAGCTAGCAGATTCGGAAATTTAAA 

1070 

ATGGGAAGGTGATTCTATCCGTACGAATGA 

1090 1110 
TCTCTATGTACATACACGTTGAAGATAGCA 

1130 

GTACAGTAGACATCAAGTCTACAGATCATT 

1150 H70 
AAACATATCTTAAATTGTAGAAAACTATAA 

1190 

ACTTTTCAATTCAAACCATGTCTGCCAAGG 

1210 1230 
AATCAAATGAGATTTTTTTCGCAGCCAAAC 

1250 

TTGAATCCAAAAATAAAAAACGTCATTGTC 

1270 1290 
TGAAACAACTCTATCTTATCTTTCACCTCA 

1310 

TCAATTCATTGCATATCATAAAAGCCTCCG 
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Fig.2A (6) 



1330 1350 
ATAGCATACAAAACTACTTCTGCATCATAT 

1370 

CTAAATCATAGTGCCATATTCAGTAACAAT 

1390 1410 
ACCGGTAAGAAACTTCTATTTTTTTAGTCT 

1430 

GCCTTAACGAGATGCAGATCGATGCAACGT 

1450 1470 
AAGATCAAACCCCTCCAGTTGTACAGTCAG 

1490 

TCATATAGTGAACACCGTACAATATGGTAT 

1510 1530 
CTACGTTCAAATAGACTCCAATACAGCTGG 

1550 

TCTGCCCAAGTTGAGCAACTTTAATTTAGA 

1570 1590 
GACAAAGTCGTCTCTGTTGATGTAGGCACC 

1610 

ACACATTCTTCTCTTGCCCGTGAACTCTGT 

1630 1650 
TCTGGAGTGGAAACATCTCCAGTTGTCAAA 

1670 

TATCAAACACTGACCAGGCTTCAACTGGTA 
1690 

GAAGATTTCGTTTTCGGGATCC 
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• 2B(1 > -3X0 

TCTAGACCACCCTAAGTCGTCCCTATGTCG 



-270 

TATGTTTGCCTCTACTACAAAGTTACTAGC 

-250 -230 
AAATATCCGCAGCAACAACAGCTGCCCTCT 

-210 

TCCAGCTTCTTAGTGTGTTGGCCGAAAAGG 

-190 -170 
CGCTTTCGGGCTCCAGCTTCTGTCCTCTGC 

-150 

GG CTGCTG C ACATAACG CGGGG ACAATG AC 

-130 -110 
TTCTCCAGCTTTTATTATAAAAGGAGCCAT 

-90 

CTCCTCCAGGTGAAAAATTACGATCAACTT 

-70 -50 
TTACTCTTTTCCATTGTCTCTTGTGTATAC 

-30 

TCACTTTAGTTTGTTTCAATCACCCCTAAT 

-10 10 
ACTCTTCACACAATTAAAATGACTGCTAAC 

M T A N 

30 

CCTTCCTTGGTGTTGAACAAGATCGACGAC 
PSLVLNKIDD 
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Fig.2B (2) 

50 70 
ATTTCGTTCGAAACTTACGATGCCCCAGAA 
I S FETYDAPE 

90 

ATCTCTGAACCTACCGATGTCCTCGTCCAG 
ISEPTDVLVQ 

110 130 
GTCAAGAAAACCGGTATCTGTGGTTCCGAC 
VKKTG ICGSD 

150 

ATCCACTTCTACGCCCATGGTAGAATCGGT 
IHFYAHGRIG 

170 190 
AACTTCGTTTTGACCAAGCCAATGGTCTTG 
NFVLTKPMVL 

210 

GGTCACGAATCCGCCGGTACTGTTGTCCAG 
GH ESAGTVV Q 

230 250 
GTTGGTAAGGGTGTCACCTCTCTTAAGGTT 
VGKGVTSLKV 

270 

GGTGACAACGTCGCTATCGAACCAGGTATT 
GDNVAIEPGI 

290 310 
CCATCCAGATTCTCCGACGAATACAAGAGC 
PSRFSDEYKS 

330 

GGTCACTACAACTTGTGTCCTCACATGGCC 
GHYNLCPHMA 
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Fig.2B (3) 

350 370 
TTCGCCGCTACTCCTAACTCCAAGGAAGGC 
FAATPNSKEG 

390 

GAACCAAACCCACCAGGTACCTTATGTAAG 
EPNPPGTLCK 

410 430 
TACTTCAAGTCGCCAGAAGACTTCTTGGTC 
YFKSPEDFLV 

450 

AAGTTGCCAGACCACGTCAGCTTGGAACTC 
KLPDHVSLEL, 

470 490 
GGTGCTCTTGTTGAGCCATTGTCTGTTGGT 
GALVEPLSVG 

510 

GTCCACGCCTCCAAGTTGGGTTCCGTTGCT 
VHAS KLGSVA 

530 550 
TTCGGCGACTACGTTGCCGTCTTTGGTGCT 
FGDYVAVFGA 

570 

GGTCCTGTTGGTCTTTTGGCTGCTGCTGTC 
GPVG LLAAAV 

590 610 
GCCAAGACCTTCGGTGCTAAGGGTGTCATC 
AKTFGAKGVI 

630 

GTCGTTGACATTTTCGACAACAAGTTGAAG 
VVDI FDNKLK 
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Fig.2B (4) 

650 670 
ATGGCCAAGGACATTGGTGCTGCTACTCAC 
MAKDIGAATH 

690 

ACCTTCAACTCCAAGACCGGTGGTTCTGAA 
TFNSKTGGSE 

710 730 
GAATTGATCAAGGCTTTCGGTGGTAACGTG 
ELIKAFGGNV 

750 

CCAAACGTCGTTTTGGAATGTACTGGTGCT 
PNVVLE CTGA 

770 790 
GAACCTTGTATCAAGTTGGGTGTTGACGCC 
EPCIK LGVDA 

810 

ATTGCCCCAGGTGGTCGTTTCGTTCAAGTT 
IAPGGRFVQV 

830 850 
GGTAACGCTGCTGGTCCAGTCAGCTTCCCA 
GNAAGPVSFP 

870 

ATCACCGTTTTCGCCATGAAGGAATTGACT 
I TVFAMKELT 

890 910 
TTGTTCGGTTCTTTCAGATACGGATTCAAC 
LFGSFRYGFN 

930 

GACTACAAGACTGCTGTTGGAATCTTTGAC 
DYKTAVGIFD 
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Fig.2B (5) 



950 970 
ACTAACTACCAAAACGGTAGAGAAAATGCT 
TNYQNGRENA 

990 

CCAATTGACTTTGAACAATTGATCACCCAC 
PIDFEQLITH 

1010 1030 
AG AT AC AAGTTCAAGG ACGCTATTGAAG CC 
RYKFKDAIEA 

1050 

TACGACTTGGTCAGAGCCGGTAAGGGTGCT 
YDLVRAGKGA 

1070 1090 
GTCAAGTGTCTCATTGACGGCCCTGAGTAA 
VKCLIDGPE* 

1110 

GTCAACCGCTTGGCTGGCCCAAAGTGAACC 

1130 1150 
AGAAACGAAAATGATTATCAAATAGCTTTA 

1170 

TAGACCTTTATCGAAATTTATGTAAACTAA 

1190 1210 
TAGAAAAGACAGTGTAGAAGTTATATGGTT 

1230 

GCATCACGTGAGTTTCTTGAATTCTTGAAA 

1250 1270 
GTGAAGTCTTGGTCGGAACAAACAAACAAA 

1290 

AAAATATTTTCAGCAAGAGTTGATTTCTTT 
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Fig.2B (6) 

1310 1330 
TCTGGAGATTTTGGTAATTGACAGAGAACC 

1350 

CCTTTCTGCTATTGCCATCTAAACATCCTT 

1370 1390 
GAATAGAACTTTACTGGATGGCCGCCTAGT 

1410 

GTTGAGTATATATTATCAACCAAAATCCTG 

1430 1450 
TATATAGTCTCTGAAAAATTTGACTATCCT 

1470 

AACTTAACAAAAGAGCACCATAATGCAAGC 

1490 1510 
TCATAGTTCTTAGAGACACCAACTATACTT 

1530 

AGCCAAACAAAATGTCCTTGGCCTCTAAAG 

1550 1570 
AAGCATTCAGCAAGCTTCCCCAGAAGTTGC 

1590 

ACAACTTCTTCATCAAGTTTACCCCCAGAC 

1610 1630 
CGTTTGCCGAATATTCGGAAAAGCCTTCGA 



CTATAGTGGATCC 
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Description 

The present invention relates to a DNA sequence, a combination of DNA sequences, a vector, a microorganism, 
a method for producing xylose reductase or xylose reductase and xylitol dehydrogenase; the invention further relates 

s to an ethanol manufacturing process, a process for production of biomass, a process for recycling of NADP* from 
NADPH and a method for producing a desired protein in Pichia stipitis. 

D-xylose is one of the most abundant carbohydrates occurring in plant biomass and wood, in the process of cel- 
lulose production, it is formed as a waste product from hydrolysis of xylan, which is the major compound of hemicel- 
lulose. To optimize the use of renewable carbon sources, it is desirable to convert xylose into ethanol or biomass. 

io There are several yeast species, such as Candida (Gong et al., 1981, Jeffries, 1983), Debaryomvces , Hansenula , 
Kluyveromyces , Metschnikowia . Pachysolen , Paecilomyces (Wu et al., 1986) and Pichia (Maleszka and Schneider 
1982), which are able to utilize pentoses, including D-xylose, and D-ribose, however, only aerobically. 

In general, pentoses utilized by yeasts (e.g. Pichia stipitis ) must be isomerized to pentuloses in order to be phos- 
phorylated. This isomerization occurs via a NAD(P)H linked reduction (reductase) to pentitols followed by NAO-linked 

15 oxidation (dehydrogenase) of the pentitols to the corresponding D-pentuloses (Barnett, 1976). The yeast mainly used 
in bioethanol production, S. cerevisiae , can utilize xylulose, however, this yeast is not able to ferment pentoses (Jeffries, 
1988). It cannot be excluded, that S. cerevisiae also contains genes, coding for pentose fermenting proteins which 
however are not expressed. 

Pentose fermentation by S. cerevisiae may be possible by providing a xylose utilising pathway from a xylose 
20 metabolizing organism. However although many attempts have been undertaken to express bacterial xylose iscmerase 
genes in S. cerevisiae , no xylose fermentation could be obtained probably due to inefficient expression of the foreign 
gene (Sarthy et al., 1987, Amore et al., 1989, Chan et al., 1986 & 1989). 

Therefore it is a primary object of the present invention to provide genes of the enzymes involved in xylose deg- 
radation in order to be able to manipulate these genes, for example to combine these sequences with suitable regulating 
25 sequences. 

This object has been solved by a DNA sequence comprising a structural gene coding for xylose reductase or 
xylose reductase and xylitol dehydrogenase and being capable of expressing said polype ptide(s) in a microorganism. 

Further objects of the present invention will become apparent by the following detailed description of the invention, 
the examples and figures. 

30 Throughout this application various publications are referenced by the first author within parenthesis. 

Full citations of these references may be found at the end of the specification as an annex. The disclosures of 
these publications in their entireties are hereby incorporated by reference into this application in order to more fully 
describe the state of the art as known to those skilled therein as of the date of the invention described and claimed 
herein. 

35 The DNA sequences according to the present invention preferably are derived from a yeast. Preferred yeast strains 

are selected from the genera Schwann iomyces , Saccharomyces , Kluyveromyces , Pichia , Hansenula, Candida , De- 
baryomvces , Metschnikowia , Pachysolen and Paecilomyces . All of these yeast genera are known to be able to convert 
xylose into ethanol using xylose reductase and xylitol dehydrogenase. 

A preferred genus used as a source for the DNA sequence according to the present invention is the yeast Pichia 

40 This genus comprises several species, any of which could be applied for performing the present invention. However 
the preferred species is Pichia stipitis . The present inventors used Pichia stipitis CBS5773 for isolation of the DNA 
sequences comprising a structural gene coding for xylose reductase and/or xylitol dehydrogenase. Pichia stipitis 
CBS5773 was redeposited under the Budapest Treaty on March 21, 1990 (DSM 5855). 

The present inventors succeeded to isolate DNA molecules containing a sequence comprising the structural gene 

45 encoding a xylose reductase and a xylitol dehydrogenase respectively. By way of the DNA sequence, which was de- 
termined according to standard procedures, the amino acid sequence of both these proteins could be determined for 
the first time. The complete amino acid sequences as well as the nucleotides sequences of both these proteins are 
shown in Figures 2A and 2B. As is known to everybody skilled in the art the proteins having the amino acid sequences 
as shown in Figures 2A and 2B can be encoded not only by the DNA sequences as found in Pichia stipitis CBS5773, 

so but also by using alternative codons provided by the degeneracy of the genetic code. The invention thus is not limited 
to the DNA sequence as shown in Figure 2, but also comprises any modification yielding the same amino acid se- 
quences. 

The DNA sequences according to the present invention may not only be obtained by applying the methods shown 
below, i.e., by isolating cDNA clones, which further on are used to screen a genomic library, but also may be obtained 
55 by other methods of recombinant DNA technology from either natural DNA or cDNA or chemically synthesized DNA 
or by a combination of two or more of these DNAs. For example, it may be attempted to combine a chemically synthe- 
sized 5' region with a cDNA coding for the 3* region or any other combination of the three DNA sources mentioned above. 

According to the present invention there are also provided combinations of DNA sequences, which comprise a 
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DNA sequence as discussed above, i.e., a sequence comprising a structural gene coding for a xylose reductase or 
xylose reductase and xylitol dehydrogenase, and in addition one or more DNA sequences capable of regulating the 
expression of the structural genes mentioned above in a presumptive host microorganism. DNA sequences capable 
of regulating the expression of structural genes are well known to those skilled in the art. For example, the DNA se- 
5 quences discussed above may be combined with promoters, which are connected with the structural genes in order 
to provide efficient expression. Further DNA sequences capable of regulating the expression may comprise enhancers, 
termination sequences and potyadenylation signals. Examples for the best known kind of regulating sequences, are 
shown by the following examples. 

In order to express the DNA sequences and/or the combination of DNA sequences according to the present in- 
fo vention efficiently, small modifications of the DNA sequences may be performed, as long as their capability to express 
a functional enzyme having the desired xylose reductase or xylitol dehydrogenase activity is retained. These modifi- 
cations may include either variations of the genetic code as discussed above or furthermore small substitutions of the 
amino acid sequence, as well as deletions and/or insertions, which do not have any detrimental impact on the respective 
enzyme activity. 

1$ In a preferred embodiment the DNA sequence, capable of regulating the expression of the structural gene, is 

derived from an endogenous gene of the microorganism, in which expression of the DNA sequence is intended. Since, 
as will be shown below in more detail, Saccharomyces cerevisiae is one of the preferred microorganism to be used in 
the present invention, there are a multitude of possible regulating sequences known. Some of these well-known se- 
quences have been used to construct expression vectors, as will be shown below in the examples. In the most preferred 

20 embodiment the combination of DNA sequences comprises inducible promoters. In this case the expression of xylose 
reductase and xylitol dehydrogenase can be prevented, as long as desired; expression may be started upon addition 
of a suitable inducer. 

In the most preferred embodiments of the present invention the following Saccharomyces cerevisiae promoters 
are used to regulate the expression of the genes encoding xylose reductase or xylose reductase and xylitol dehydro- 
25 genase: ADH1 , ADH2 , PDC, GAL1/10. 

Depending on the choice of the respective promoter it may be possible to obtain expression levels exceeding that 
of natural expression of both proteins in their original host organism. 

The DNA sequences as well as the combinations of the DNA sequences according to the present invention may 
be introduced in vector molecules. These molecules may be plasmids : which are suitable for replication in the desired 
30 host microorganism and thus should contain a functional origin of replication. Alternatively, it is also possible, to use 
linear DNA fragments carrying the DNA sequence or combination of DNA sequences according to the present invention 
or to use circular DNA molecules being devoid of a functional origin of replication. In this case the vector, which is not 
capable of replication, will be inserted by either homologous or nonhomologous recombination into the host chromo- 
some. 

35 Subject of the present invention are further microorganisms, which have received DNA sequences comprising the 

inventive DNA sequences or combinations of DNA sequences coding for xylose reductase and xylitol dehydrogenase 
by recombinant DNA technology. 

Preferred microorganisms are selected from a group consisting of yeast of the genera Saccharomyces, Schizosac- 
charomvces . Schwann iomyces , Kluyveromyces, Pichia , Hansenula , Candida , Debarvomyces , Metschnikowia , Pach- 
ysolen or Paeci Iomyces or bacteria of the genus Zymomonas . 

From these organisms the most preferred microorganisms are Saccharomyces cerevisiae and Schizosaccharo- 
myces pombe and Zymomonas . 

One of the possible applications of the genetically altered yeast strains described above is the production of bio- 
mass. Since the yeast strains having acquired the ability of expressing xylose reductase or xylose reductase and xylitol 
45 dehydrogenase are maintaining good fermentation abilities, biomass can be produced most efficiently by use of these 
inventive yeast strains. The methods for producing biomass are the usual ones, which are known to everybody skilled 
in the art. The genetically manipulated yeast strains provided in compliance with this invention are also suitable for the 
production of ethanol. The preferred organisms for use in the production of ethanol by fermentation are the yeasts 
Saccharomyces cerevisiae and/or Schizosaccharomyces pombe and/or the bacterium Zymomonas . 
so The preferred carbohydrate in the ethanol production is xylose. Thus, strains of Saccharomyces cerevisiae and/ 

or Schizosaccharomyces pombe and/or Zymomonas being able to ferment xylose are highly advantageous in the 
production of ethanol. The production of potable spirit or industrial ethanol by use of a genetically manipulated yeast 
strain according to the present invention can be carried out in a manner known per se . The inventive yeast strains 
have the ability to ferment concentrated carbohydrate solutions, exhibit high ethanol tolerance and have the ability of 
55 producing elevated concentrations of ethanol; they have a high cell viability for repeated recycling and exhibit remark- 
able pH-and temperature tolerance. In the process of xylose production xylose is formed as a waste product from 
hydrolysis of xylan, which is the major compound of hemicellulose. Hence it is of great advantage to use xylose for the 
production of ethanol and/or biomass. The invention is further suitable for the production and isolation of the NAD(P) 
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H linked xylose reductase. Due to the reduction reaction this enzyme is suitable for the delivering or recycling (from 
NADPH to an NADP*) of the corresponding coenzyme especially in bioreactors, for example for the production of 
amino acids. 

A further subject of the present invention is a method for producing the xylose reductase or xylose reductase and 
5 xylitol dehydrogenase by cultivating a microorganism according to the present invention under suitable conditions and 
recovering said enzyme or both of them in a manner known per se . The method thus includes the expression of a DNA 
sequence or a combination of DNA sequences according to the present invention in a suitable microorganism, culti- 
vating said microorganism under appropriate conditions and isolating the enzyme. 

It could be shown, that the level of expression of desired proteins in the inventive microorganisms is enhanced, if 
io the microorganism has been selected for efficient fermentation of xylulose. Thus, is is preferred, to perform the method 
for reproducing one or both of the proteins using microorganisms, which have been selected accordingly. 

Since the present invention provides the cloned genes and the corresponding sequences, the gene products can 
be overproduced in other organisms, e.g. in yeasts of the genera Saccharomvces, Schizosaccharomyces , Schwanni- 
omvces , Kluvveromvces, Pichia , Hansenula , Candida , Debarvomvces , Metschnikowia , Pachvsolen , Paecilomvces or 
'5 bacteria of the genus Zymomonas . The techniques employed for obtaining expression of the XYL1 (xylose reductase) 
or XYL1 and XYL2 (xylitol dehydrogenase) gene and the isolation of the active gene product are the usual ones such 
as promoter-fusion, transformation, integration and selection, and methods of protein isolation, known by the man 
skilled in the art. 

Generally, said microorganisms have received the DNA sequence or combination of DNA sequences via transfor- 
20 mation procedures. For each of the possible microorganisms, i.e. the different yeast genera and bacteria of the genus 
Zymomonas , there are transformation procedures known. The transformation is preferably carried out using a vector, 
which may be either a linear or circular DNA molecule; in addition, the method can be performed using autonomously 
replicating or integrative molecules as well In the case, that the molecule is supposed to integrate into the genome of 
the respective host, it is preferred, to use a vector containing DNA, which is homologous to the DNA of said intended 
2$ host microorganism. This measure facilitates homologous recombination. 

Further subjects of the present invention are the enzymes produced according to the above described method. 
The microorganisms according to the present invention may be used in ethanol manufacturing processes. Since 
xylose is a readily available source, which normally is considered to be waste, the ethanol manufacturing process 
according to the present invention provides a possibility for ethanol production of high economical and ecological 
30 interest. 

The ethanol manufacturing process may be adapted for the production of alcoholic beverages or single cell protein 
from substrates containing free xylose, which is preferably released by xylanase and/or xylosidase activity from xylan. 

According to the present invention there is further provided a method for the production of a desired protein in 
Pichi a stipitis . According to this method a structural gene coding for a desired protein is expressed under control of 
35 the 5' regulating region of the XYL1 and/or XYL2 gene from Pichia stipitis and/or the ADH1 promoter of S. cerevisiae 
and/or the glucoamylase promoter from Schwanniomyces occidentalis. Out of the promoters mentioned before use of 
the 5' regulating regions of the XYU or XYL2 genes is preferred, because these promoters may be induced by adding 
xylose. Pichia stipitis , when used as a host organism, exhibits the great advantage of having an efficient secretion 
system. This facilitates an efficient expression not only of proteins, which stay inside the cell, but also of proteins, which 
to are continuously secreted into the medium. A further advantage of the Pichia stipitis expression system is the possibility 
of using xylose as a substrate. Xylose is a rather inexpensive, readily available nutrient. 

The invention will be discussed in detail by way of the following figures and examples. 
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BRIEF DESCRIPTION OF THE FIGURES: 
Fig. 1 

A: restriction map of the DNA fragment encoding the xylose reductase gene (XYL1) 

50 E: EcoRl, H: Hindlll, B: BamHI, N: Ncol, 

P: Pvull, Ps: Pst1 

B: restriction map of the DNA fragment encoding the xylitol dehydrogenase gene (XYL2 ) 

Ba: BamHL B: BoJII, E: EcoRI, X: Xbal, S: Sail 
Fig. 2 
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A) Nucleotide sequence of the XYL1 structural gene including its 5'- and 3'-flanking sequences and the cor- 
responding amino acid sequence. 

B) Nucleotide sequence of the XYL2 structural gene including its 5'- and 3-flanking sequences and the cor- 
responding amino acid sequence. 

Fig. 3 S. cerevisiae and S. pombe expression vector. Plasmid pRD1 contains both the xylose reductase gene and 
xylitol dehydrogenase gene under control of their original promoters. 

Fig. 4 Fermentation curve of PK4 grown in YNB, 2% xylose medium. The culture was inoculated with 10 s cells/ml 
from a xylose grown preculture. The figure shows xylose consumption and conversion into ethanol with a theoretical 
maximum yield. 

Fig* 5 (1 ,2) Construction scheme for constructing the vector pBRPGAM. For constructing this vector, the 38 kb 
EcoRI-Pyull-fragment from pBRSwARSGAM containing the functional GAM promoter and base pairs 1 to 208 of 
the coding GAM sequence was ligated to the small EcoRI-Pvull-fraqment of pBR322. 

Fig. 6(1,2) Construction scheme for constructing the vector pBRGCI . For constructing this vector, the 3.4 kbPvull- 
fragment of pCT603 containing the structural gene for xylose starting with nucleotide + 122 was inserted into the 
Pvyll site of vector pBRPGAM. 

Fig. 7 (1,2) Construction scheme for constructing the vector pMPGC1-2. The 6.5 kb Bam HI-Pstl-fragment of 
pBRGCI containing the cellulase gene under control of the GAM promoter was ligated with the large BamHI-Pstl- 
fragment of pCJD5-1. 

EXAMPLES 

Materials and Methods 

I. Microorganisms and cultivation 

Yeast strains: 

1 . S. cerevisiae: 

a) XJB3-1B (MAT a, met6, gal2) was obtained from the Yeast Genetic Stock Center (see Catalogue of the 
Yeast Genetic Stock Center, 6. edition, 1987). 

b) GRF18 (MAT a, leu2-3. Ieu2-112 ( his3-11, his3-15) was obtained from G.R. Fink (DSM 3796). 

c) AH22 (MATa, can1 v his4-519, Ieu2-3, Ieu2-112) was obtained from A. Hinnen (DSM 3820). 

2. Schizosaccharomyces pombe (leu1-32, his5-303) (DSM 3796). 

3. R stipitis CBS5773 (DSM 5855) was obtained from Centraalbureau voor Schimmelcultures, Yeast Division, 
Delft, The Netherlands. 

Yeast strains were grown at 30°C in YP medium (1% yeast extract, 2% bacto pepton) or in 0.67% Difco yeast 
nitrogen base (YNB) without amino acids, optionally supplemented with appropriate amino acids. Media were supplied 
with either 2% xylose or 2% glucose. The yeasts were transformed according to Dohmen et al. (1989). 

E. coli strains: 

1. DH5aF' (supplied by BRL company, Eggenstein, FRG) 

2. HB101 (DSM 3788) (Bolivar et al., 1977). 

E. coli strains were grown at 37°C in rich medium (IB-medium, Maniatis et al., 1982). The medium was supple- 
mented with penicillin G (100 ng/ml) when selecting for transformants. E. coli transformation was carried out as de- 
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scribed by Maniatis ( 1 982). 

II. Purification of the XR and XDH proteins from P. stipitis 

5 Cells were grown under induced conditions to exponential growth phase. To prepare cell-free extracts cells were 

harvested by centrifugation and were broken with glass beads in a Braun homogenizer using 0.1 M Tris-HCI buffer 
(pH 7.0). The supernatant obtained following 1 h centrifugation of the crude extract (150000 x g) was loaded on an 
affinity chromatography column (Affi-Gel Blue, 60x50 mm) preequilibrated with 5 mM NaP0 4 buffer (pH 6.8) and eluted 
with 1.5 mM NAD. The fractions containing XR and XDH activity were pooled and dialysed against 20 mM Tris-HCI 

10 (pH 7.5). The dialysate was subsequently applied to a DEAE-Sephacel anion exchange column preequilibrated with 
20 mM Tris-HCI (pH7.5). Proteins were eluted with a linear gradient (20-250 mM Tris-HCI, pH 7.5). Fractions containing 
the highest activity were pooled, concentrated and loaded on a SDS-PAA-geL After running the gel was stained with 
0.1 M KCI and the XR- and XDH-proteinbands were cut out, both proteins were separately eluted from the polyacry- 
Jamide gel by dialysis using 20 mM NaP0 4 (pH 8.0) : 0.1% SDS; subsequently the dialysate was concentrated. Ail 

is- buffers contained 0.2 mM DTT (Dithiothreitol) and 0.4 mM PMSF (Phenylmethansulfonylfluoride). 

III. Preparation of antlsera 

Mice were given intraperitoneal injections of 2-5 jig protein in Freund complete adjuvant. Two weeks later the same 
20 amount of protein in Freund incomplete adjuvant was injected; a third injection was administered another 2 weeks later 
omitting Freund adjuvant. Antiserum was harvested six weeks after the first injection. 

IV. Immunoscreening 

25 Antisera raised in mice against purified P. stipitis xylose reductase (XR) and xylitol dehydrogenase (XDH) protein, 

respectively, were used for screening the cDNA library following the procedure of Huynh et al. (1985). The antisera 
were diluted 10.000-fold. Bound antibodies were visualized using an alkaline phosphatase-conjugated goat anti-mouse 
immunoglobulin antibody, followed by a colour development reaction with the phosphatase substrate 5-bromo-4-chloro- 
3-indolyl phosphate (BCIP) in combination with nitro blue tetrazolium (NBT). 
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V. Isolation of RNA 



All procedures were carried out at 0 to 4°C, if not indicated otherwise. All solutions and materials were sterilized 
if possible. R stipitis cells were grown to midexponentia! phase in the presence of xylose. Yeast cells were harvested 

35 by centrifugation, washed twice with buffer 1 (20 mM NaCI, 10 mM MgC^, 100 mM Tris-HCI, ph 7.6) and suspended 
in the same buffer (1 .25 mi/g cells). 1/10 volume phenol, 200 jig/ml heparin, 100 ng/ml cycloheximid and 0.4% SDS 
were added. Disruption of the cells was carried out by shaking with glass beads (0.45 - 0.5 mm) in a ratio of glass 
beads to suspension of 1:1 (v/v) in a Braun homogenizer (Braun, Melsungen). Two volumes of buffer 2 (buffer 1 con- 
taining 100 u,g/ml heparin, 50 ng/ml cycloheximid, 2% SDS) were added to the homogenate, cell debris were removed 

40 by centrifugation (10000 x g, 10 min). The solution was extracted three to five times with phenol/chloroform (1:1), once 
with chloroform/ isoamylalcohol (24:1). The nucleic acid was precipitated by incubating the aqueous phase with 2,5 
volume of ethanol in the presence of 0.2 M NaCI over night at -20°C. The precipitate was solubilized in buffer 3 (0.1 
M NaCI, 1 mM EDTA, 10 mM Tris-HCI, pH 7.5); SDS and LiCI were added to a final concentration of 0.1% and 4 M, 
respectively. The RNA was precipitated over night at +4'C. The pellet was washed twice with 70% ethanol and sus- 

45 pended in sterilized H^ before use. RNA was stored at -70 0 C as an ethanol precipitate. 

VI. Enzyme assays 

Activities of xylose reductase (EC. 1.1.1.21) and xylitol dehydrogenase (EC. 1.1.1.9) were measured as described 
50 by Bruinenberg et al. (1983). Protein was determined with the micro biuret method according to Zamenhoff (1957) 
using bovine serum albumin as standard. 

VII. Gelelectrophoresis 

55 SDS gelelectrophoresis was carried out in 10% PAA according to Laemmli (1 970). 
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VIII. Immunoblotting 

Detection of antigenic proteins was carried out as described by Towbin et al. (1979) using the antisera obtained 
from mice. The proteins were transferred to a polyvinylidene difluoride microporous membran (Millipore, Immobilon 
s PVDF) and were visualized by a phosphatase-coupled colour reaction (Blake et al. . 1 984). Alkaline phosphatase con- 
jugated to goat anti-mouse IgG was obtained from Jackson Immunoresearch Lab. (Avondale, USA). 

IX. DNA-sequence analysis 

io XYL1 and XYL2 genomic DNA as well as the respective cDNAs were subcloned in pT7T3-1 3U (Pharmacia). Frag- 

ments obtained by partial digestion using Exonuclease III (Henikoff, 1984) were analysed and sequencing was carried 
out by the dtdeoxy method of Sanger et al. (1977) using the T7-Sequencing™kit (Pharmacia). Both strands were 
completely determined by obtaining overlapping sequences at every junction. 

is X. Construction of a R stlpitis CBS 5773 (DSM 5855) cDNA library 

Total RNA was extracted according to the method described above. Poly (A) + -RNA was prepared by chromatog- 
raphy on an oligo(dT)-cellulose column using essentially the method described by Maniatis et al. (1 982). A cDNA library 
in \gt11 was prepared by the method of Gubler and Hoffman (1983) using a cDNA synthesis kit (Pharmacia) and jn 
20 vitro packaging of the recombinant Xgt11-DNA according to Hohn and Murray (1974) using the in vitro packaging kit 
supplied by Boehringer, Mannheim (FRG). 

XI. Preparation of crude extracts 

25 Cells were grown to late exponential growth phase and washed twice in buffer (10 mM potassium phosphate, pH 

7.0, 1 mM EDTA, 5 mM p-mercapto ethanol). Cells were broken in an Braun homogenizer with an equal volume of 
glass beads. The supernatant resulting from 5 min centrifugation at 10000 g was used in enzyme assays. Extracts for 
Western blot analysis were boiled in 1% SDS t 5% p-mercapto ethanol, 10 mM potassium phosphate pH 7.0 and 10% 
glycerol. 

30 

EXAMPLE 1: 

Isolation of the xylose reductase (XYL1 ) and xylitol dehydrogenase (XYL2) genes. 

35 A Xgt1 1 cDNA library constructed from poly (A) + -RNA of P. stipitis was screened with mouse polyclonal antibodies 

raised against the purified xylose reductase (XR) and xylitol dehydrogenase (XDH) proteins, respectively. Among 
110.000 recombinant clones of the amplified cDNA library containing about 55.000 primary clones, seven identical 
XYL1 clones and three identical XYL2 clones were identified and purified. The analysis of the insert size revealed that 
the XYL1 clones contain two Eco R1 fragments (0.6 kb and 0.4 kb), whereas the XYL2 clones contain a single 0.55 kb 

40 Eco R1 fragment. The respective EcoRI fragments of the Xgt11 clones were subcloned into the single Eco RI site of 
plasm id pT7T3-18U (Pharmacia) resulting inplasmids pXRa (containing the 0.4 kb Eco RI fragment of the XYL1 clone), 
pXRb (containing the 0.6 kb EcoRI fragment of the XYL1 clone) and pXDH (containing the 0.55 kb Eco RI fragment 
of the XYL2 clone). 

These plasmids were used as a radioactive probe to screen a P. stipitis genomic library, which was constructed 
*s by ligation of partial Sau3A digested P. stipitis DNA into the single Bam HI site of the S. cerevisiae - E. coli shuttle 
vector YEp1 3 (Broach et al., 1 979) resulting in about 60.000 independent clones after transformation of E. coli HB101 . 

Two plasmids, namely pR1 and pD1 could be isolated and were used for transformation of S. cerevisiae GRF18. 
XR activity could be detected in the crude extracts of the transformants containing pR1 , whereas transformants carrying 
pD1 yielded crude extracts exhibiting XDH activity. In a mitotic stability test (Beggs 1978) the LEU2 marker and the 
so XR or XDH gene cosegregated, indicating that pR1 and pD! harbour the functional XYL1 (xylose reductase) and XYL2 
(xylitol dehydrogenase) gene, respectively. 

The plasmids pR1 and pD1 were subjected to restriction enzyme analysis yielding the map of restriction sites of 
the XYL1 (Fig. 1A) and XYL2 (Fig. 1B) genes, respectively. 

Further subcloning experiments revealed that the XYU gene is encoded within a 2.04 kb Bam HI genomic fragment. 
5 $ One of the Bam HI sites is not present in the original plasmid pR1 . it must have been generated during subcloning. The 
XYL2 gene is encoded within a 1 .95 kb BamHI-Xbal fragment. The 2.04 Bam HI fragment and the 1.95 kb BamHI -Xbal 
fragment were subcloned into the multiple cloning site of pT7T3-18U resulting in pR2 and pD2, respectively, and sub- 
jected to DNA sequence analysis. The DNA sequence of the structural gene and of the 5' and 3' non-coding region of 
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the XYL1 and the XYL2 gene is depicted in Fig. 2A and Fig. 2B, respectively. 

The DNA sequence of the XYL1 gene contains an open reading frames of 954 bp (31 8 amino acids) whereas that 
of the XYL2 gene comprises an ORF of 1089 bp (363 amino acids). 

The amino acids deduced from the open reading frames are shown in Fig. 1A and Fig. 2B. The sequences corre- 
5 spond to an XR polypeptide and an XDH polypeptide with a calculated molecuar weight of 35922 and 38526 D, re- 
spectively. 

EXAMPLE 2 

10 Expression of both the xylose reductase and xylitol dehydrogenase gene in S. cer evtsiae. 

Saccharomyces cerevisiae was contransformed with pRl and pD1 . The hignest measurable activities of XR and 
XDH in S. cerevisiae transformed accordingly correspond to 50% of the activities of both enzymes measurable in R 
stipitis crude extracts. In S. cerevisiae the genes were expressed in YNB medium containing 2% glucose as a sole 
'5 carbon source, whereas in P. stipitis expression of both genes is repressed by glucose and induced by xylose. Taking 
into account the copy number of 10 of YEp13 in S. cerevisiae and assuming a gene dosage dependent expression 
one can conclude that the Pichia promoters are 20 times less efficient in S. cerevisiae than in P. stipitis. 

Furthermore, a plasmid harbouring both the XYL1 and XYL2 gene including their original Pichia promoters was 
constructed (Fig. 3). This plasmid pRD1 was used to transform strain GRF18 by selection on leucine resulting in the 
20 transformant PK1 . However, expression was not improved compared to cotrans'ormation with separate plasmids. 

EXAMPLE 3 

Construction of an integrative vector containing the XYL2 gene under control of different promoters 

25 

Different expression vectors using different promoters for integration and gene expression in S. cerevisiae were 
constructed. For example the XYL2 gene was fused to the ADH1 promoter fol'cwed by homologous integration into 
the HIS 3 locus of S. cerevisiae . The strategy employed was as follows: The 1.5 kb Xba l/ Eco R1 fragment containing 
the xylitol dehydrogenase gene XYL2 was inserted into the multiple cloning site of pT7T3-18U (Pharmacia) resulting 

30 in plasmid pXDH. To eliminate the promoter region of the XYL2 gene this plasmid was linearized with Xba l (restriction 
site 318 bp upstream of the initiator ATG codon) and with Pstl to protect the 3 1 end of the plasmid DNA. The linear 
plasmid was treated with exonuclease III and subsequently with S1 nuclease to remove the DNA between the Xba l 
site and the XYL2 structural gene. The deleted DNA molecules were recirculated, cloned in E. coli and the extent of 
deletion was determined by dideoxy sequencing. In one of the modified pXDH piasmids the 5* untranslated region and 

35 the four N-terminal amino acids were deleted. However, a new inframe ATG initiation codon was created due to the 
Sphl site from the multiple cloning site. A Bam HI linker was inserted into the Hind i 1 1 site of the multiple cloning site. 
Subsequently, a 1.5 kb Bam HI fragment carrying the XYL2 gene could be subcfened into vector pT7T3-l8U resulting 
in additional restriction sites in front of the ATG initiation codon. The newly created 5' region is as follows: ATG CCT 
TGG TGT... (deletion of original amino acid 2,3 and 4). 

40 To complete the 3* untranslated region of the XYL2 gene a 440 bp Eco RI fragment, was inserted into the single 

Eco RI site of the 1 .5 kb fragment subcloned in pT7T3-18U. This 440 bp fragment was obtained by subcloning the 440 
bp EcoRI -Bam HI fragments (see Fig. 1B) into another pT7T3-18U t removing fre Bam HI site by cutting with Bam HI 
and subsequent filling-in with Klenow polymerase. The 3' untranslated region cculd thus be isolated as 440 bp Eco RI 
fragment. In the single Bam HI site arranged near the 5' terminus of the XYL2 gene, which is provided by the porylinker 

4 $ region, the 1 .8 kb Bam HI fragment harbouring the S. cerevisiae His3 gene derived from plasmid YEp6 (Struhl et al. 
1979) was inserted. To remove one of the two Bam HI sites the resultant plasmid was cut with Sail and Xho l and 
subsequently recircularized. The resulting plasmid pXDH-HIS3 contains one suitable Bam HI site in front of the ATG 
initiation codon in which the 1.5 kb Bam HI fragment, containing the ADH1 -promoter (Ammerer, 1983) of S. cerevisiae 
can be inserted. 

50 Since this plasmid does not contain any autonomous replicating sequence for S. cerevisiae this plasmid can be 

used for homologous integration (Orr-Weaver et al. 1981 ) into the HIS3 locus cf any S. cerevisiae strain. 

In our integration experiments we used a mutagenized XJB3-1B strain called =UA6-1 , which was isolated according 
the protocol of Porep, (1987) and Ciriacy, (1986). The resulting integrant PK2 is expressing the XYL2 gene under 
control of the ADH1 promoter leading to an active gene product. 

55 
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EXAMPLE 4 

Construction of S. cerevisiae and S. pombe Integrants expressing both the XYL1 and XYL2 gene. 

s To eliminate the promoter region of the XYL1 gene plasmid pR2 containing the XYL1 gene on a 2,04 kb Bam HI 

fragment was linearized with Xba l ( restriction site 362 bp upstream of the translation initiation ATG codon) and cleaved 
with Sph l to protect the 3* end of the plasmid DNA. The linear plasmid was treated with exonuclease III and subsequently 
with S1 nuclease to remove the DNA between the Xba l site and the XYH structural gene. The deleted DNA molecules 
were ^circularized, cloned in E. coll and the extent of deletion was determined by dideoxy sequencing. In one of the 

10 modified pR2 plasmids the 5' untranslated region was exactly deleted. 

The structural gene was subcloned as a Hin dlll -Bam HI fragment into the corresponding sites of Ylp366 (Myers et 
aL 1986). In addition the ADH1 promoter was subcloned into the Hindlll site by blunt end ligation resulting in plasmid 
pXR-LEU2. Since this plasmid does not contain any autonomous replicating sequence for S. cerevisiae this plasmid 
can be used for homologous integration (Orr-Weaver et al. 1931) into the LEU2 locus of any S. cerevisiae strain, e.g. 

'5 strain PK2. The resulting integrant PK3 is expressing both the XYL1 and XYL2 genes under control of the ADH1 
promoter leading to active gene products. For expression studies in Schizosaccharomyces, S. pombe was transformed 
with both plasmids pXDH-HIS3 and pXR-LEU2 selecting for histidine and leucine. After extensive screening of the 
transformants for growth on xylose one transfonmant called AS1 could be isolated expressing both the XYL1 and XYL2 
gene under control of the ADH1 promoters. 

20 in the same manner other S. cerevisiae promoters, e.g. pyruvate decarboxylase (PDC ) promoter (Kellermann & 

Hollenberg, 1988), alcoholdehydrogenase 2 (ADH2) promoter (Russell et al., 1983) or the galactokinase (GAL1/10 ) 
promoter from plasmid pBM272, which is derived from plasmid pBM150 (Johnston and Davis, 1984) by introducing a 
Hin di II site immediately following the Bam HI site, led to expression of active XYL1 and XYL2 gene product in S. cer- 
evisiae . 

25 in another set of experiments two suitable restriction sites Bam HI (position -9) and Sail (position - 15) were intro- 

duced just in front of the XYL1 and XYL2 genes. 



XYL1 : 5 ' attCttttCta GTCGAC GGATCCAAGATGCCTTCTATT 
. . .TAA terminator 3' 



35 XYL2 : 5 ' CCCCtaatac tGTCGAC GGATCCAAGATGACTGCTAAC 

. . .TAA terminators' 



*o These modifiations were introduced by site directed mutagenesis of the 5' region using the site directed mutagen- 

esis kit supplied by Amersham according to the instructions of the manufacturer. These restriction sites offer the pos- 
sibility to fuse any promoter just in front of the ATG initiation codon. In addition the gene under control of a desired 
promoter can be isolated as a well defined fragment for insertion into sequences suitable for homologous integration. 
For industrial or commercial purposes it is desirable to construct stable production strains of S. cerevisiae and/or 

45 S. pombe . Therefore both genes under control of the constitutive ADH1 promoter were integrated without any bacterial 
sequence into the chromosome of S. cerevisiae strain PUA6-1 via homologous integration (Orr-Weaver et aL1981). 
Integration into the HO homothallism gene (Russel et al. 1986), ARS-sequence (Stinchcomb et al., 1978) or into the 
ADH4 gene (Paquin et al., 1986) by cotransformation with pJW6 (Fogel and Welch, 1982) is preferred resulting in 
strains PK3(HO), PK3(ARS) and PK3(ADH4). In the case of S. pombe the integration mainly occurs via illegitimate 

so recombination. Hence only a few of the S. pombe integrants exhibit XR and XDH activities and have the same fermen- 
tation and growth properties as the wild type. 

The S. cerevisiae integrants PK3, PK3(HO), PK3(ARS) and PK3(ADH4) may be improved for efficient assimilation 
of xylulose. 

55 



9 



EP 0 450 430 B1 



EXAMPLE 5 

Isolation of a S. cerevlsiae mutant efficiently assimilating xylulose. 

S. cerevisiae strain XJB3-1B grows slowly on media containing xylulose as a sole carbon source (doubling time 
10 hours). According to a protocol described by Porep (Porep, 1937) a mutant, PUA3, was isolated, which utilized 
xylulose more efficiently than wild type S. cerevisiae strains, resulting in a doubling time of approximately four hours 
for growth on xylulose as a sole carbon source. 

Mutant strain PUA3 also converts xylulose into ethanol in the absence of respiration (Porep, 1987). In order to 
obtain the PUA genotype in combination with an auxiliary marker (LEU2) useful in yeast transformation, strain PUA3 
was crossed to AH22 (Ieu2 his4). From a sporulating culture of the AH22xPU A3 diploid meiotic spore progenies were 
isolated which were ]eu2 and had the ability of efficient xylulose-utilization as observed in the original mutant, PUA3. 
In an analogous experiment the PUA genotype was combined with ]eu2 and his3 auxiliary markers by crossing strain 
GRF1 8 and PUA strain and subsequent meiotic spore isolation. This resulted in strain PUA6-1 which was PUA Ieu2his3 . 

EXAMPLE 6 

Isolation of a S. cerevlsiae mutant efficiently converting xylose Into ethanol. 



Strain PUA6-1 containing the XYL1 and XYL2 genes chromosomalry integrated (See Examples 3 and 4) was able 
to grow on xylose as a sole carbon source whereas the untransformed PUA6-1 strain was completely negative on YNB 
xylose media. Doubling time of the transformant strain PK3 was 4 hours on YNB 1% xylose (for comparison, doubling 
time on YNB 1% glucose: 2 hours). Since ethanol production was inefficient in this strain when grown on xylose and 
no xylose growth was observed in the absence of respiration a mutant strain with improved capability in converting 
xylose to ethanol was selected as follows: 1 0 8 PK3 cells were mutagenized with U V (254 nm) using conditions allowing 
20% to 40% of the cells survival. The surviving cells were grown for approximately 30 generations in YNB 2% xylose 
liquid media. After plating on xylose solid media isolates were obtained which grow significantly faster than the parent 
strain PK3. One isolate was further propagated and used for selection of a mutant able to grow on YNB 2% xylose 
plates supplemented with 2 mg/l antimyctn A in order to block respiratory metabolism. This procedure yielded a mutant 
(PK4) which was able to convert xylose significantly more efficiently to ethanol than the original transformant strain 
PK3. A typical xylose fermentation protocol is depicted in Fig. 4. The ethanol yield was approximately 40% of the initial 
xylose. This yield corresponds to approximately 30% of the theoretical maximum yield of ethanol from xylose conver- 
sion. 

EXAMPLE 7 

Expression of heterologous genes in Pichia stipitis 

Following UV mutagenesis of Pichia stipitis strain CBS 5773 (DSM 5855) a trp5 mutant was isolated. The trp5 
mutation was identified by examining indol accumulation according to Hagedom and Ciriacy (Hagedom and Ciriacy, 
1 9S9). 

For expression in Pichia stipitis plasmids were constructed which contain a replicon from Schwann iomvces occi- 
dental (SwARSt ), the TRP5-gene from S. cerevisiae (Dohmen et al., 1989) as a selective marker and in addition a 
glucoamylase(GAM)-cellulase (celD) gene fusion under control of the glucoamylase promoter. In a first step the 3.8 
kb EcoRI-Pvylt-f ragment from plasmid pBRSwARSGAM (Fig. 5. described in EP 89 1 07 780) was isolated and inserted 
into the 2296 bp EcoRI-Pvy 1 1 -fragment from pBR322 carrying the ampicillin resistance gene and the bacterial origin 
of replication, resulting in plasmid pBRGAM (Fig. 5). In addition to pBR322 derived sequences this plasmid carries 3.6 
kb derived from the 5* noncoding region of the glucoamylase gene from Schwann iomvces occidentalis and nucleotides 
1 to 208 coding for the N-terminal part including the signal sequence of the glucoamylase. Subsequently, a 3.4 kb 
Pyul I -fragment derived from plasmid pCT603 (Joliff et al., 1986) containing the coding region of the celD-genes from 
Clostridium therm ocellum with the exception of 120 bp (corresponding to 40 amino acids) starting with the 5' terminus 
of the coding region was inserted into the Pvyll site cf the pBRGAM resulting in pBRGCI (Fig. 6). For construction of 
a j^stipjtis expression vector plasmid pCJD5-1 (EP 37 110 370. 1 ) was cleaved with Bam HI/Pstl and ligated with a 6.5 
kb BamHI-Pstl-fragment from pBRGCI. The resulting plasmid was termed pMPGC1-2 (Fig. 7). The above described 
p - stipitis mutant trp5 was transformed with pMPGCI -2 and the transformants were identified by their capability to grow 
on medium free of tryptophan (tryptophan prototrophy). Transformants were examined for cellulase activity using the 
congo red assay (Teather & Wood, 1982). The transformants ccnstitutively produce active cellulase (endoglucanase 
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D) of Clostridium thermoceHum , which is secreted into the media, indicating, that the promoter and the signal sequence 
encoded by the glucoamylase gene may control expression of a heterologous gene and secretion of the gene product 
into the medium. 

Subsequently plasmid pMPGCl-2 was modified in order to substitute the glucoamylase promotor either by theS. 
5 cerevisiae ADH1 -promoter or the inventive 5' regions of the XYL1 or XYL2 gene, respectively. It could be shown, that 
the expression under control of the XYL1 or XYL2 promoter region may be induced by xylulose, while being repressed 
by glucose. 
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Claims 



Claims for the following Contracting States : AT, BE, CH, LI, DE, DK, FR, GB, GR, IT, LU, NL, SE 

1 . DNA sequence, characterized in that said DNA sequence comprises a structural gene coding for a xylose reductase 
having the following amino acid sequence: 
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10 

MPSIKLNSGY 

5 

20 

DMPAVGFGCW 

30 

KVDVDTCS E Q 



15 



20 



25 



30 



35 



40 



45 



SO 



55 



40 

IYRAIKTGYR 

50 

LFDGAEDYAN 

60 

EKLVGAGVKK 

70 

AIDEGIVKRE 

80 

DLFLTSKLWN 

90 

NYHHPDNVEK 

100 

ALNRTLSDLQ 

110 

VDYVDLFLIH 

120 

FPVTFKFVPL 

130 

EEKYPPGFYC 

140 

GKGDNFDYED 
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30 
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150 

VPILETWKAL 

160 

EKLVKAGKIR 

170 

SIGVSNFPGA 

180 

LLLDLLRGAT 

190 

IKPSVLQVEH 

200 

HPYLQQPRLI 

210 

EFAQSRG IAV 

220 

TAYSSFGPQS 

230 

FVELNQGRAL 

240 

NTSPLFENET 

250 

IKAIAAKHGK 

260 

SPAQVLLRWS 

270 

SQRGIAIIPK 

280 

SNTVPRLLEN 
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15 



v ^ 290 
KDV NSPD.LDE 

300 

QDFADIAKLD 

310 

INLRFNDPWD 



WDKI PI'FV* 

20 wherein said DNA sequence is capable of expressing said polypeptide in a microorganism. 

2. The DNA sequence according to claim 1 , characterized in that said DNA sequence further comprises a structural 
gene coding for xylitol dehydrogenase having the following amino acid sequence: 

25 



30 



35 



40 



45 



50 



55 



16 



10 



1S 



20 



25 



30 



35 



40 



45 



SO 



55 
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10 

MTANPS LVLN 

20 

KIDDISFETY 

30 

DAPEISEPTD 

40 

VLVQVKKTG I 

50 

CGSDIHFYAH 

60 

GRIGNFVLTK 

70 

PMVLGHESAG 

80 

TVVQVGKGVT 

90 

SLKVGDNVAI 

100 

EPGIPSP. FSD 

110 

EYKSGHYNLC 

120 

PHMAFAATP-N 

130 

SKEGEPNPPG 
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140 

TLCKYFKSPE 

150 

DFLVKLPDHV 

160 

SLELGALVEP 

170 

LSVGVHASKL 

20 180 

GSVAFGDYVA 



10 



15 



25 



3S 



40 



45 



SO 



55 



190 

VFGAGPVGLL 



200 

" AAAVAKTFGA 

210 

KGVIVVDIFD 



220 

NKLKMAKDIG 

230 

A A T H T F N S K T 

240 

GGSEELIKAF 

250 

GGNVPNVVLE 

260 

CTGAEPCIKL 
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270 

GVDAIAPGGR 

280 

FVQVGNAAGP 

290 

VSFPITVFAM 

300 

KELTLFGSFR 

310 

YGFNDYKTAV 

320 

GI FDTNYQNG 

330 

RENAPI DFEQ 

340 

LITHRYK FKD 

350 

AIEAYDLVRA 

360 

GK GAVKCLID 



G P E * 



The DNA sequence according to claims 1 or 2, characterized in that said DNA sequence is derived from a yeast, 
preferably from a yeast selected from a group consisting of the genera Schwanniomvcas . Saccharomvces . Klu- 
v yeromyces , Pichia, Hansenula , Candida, Debarvomvces . Metschnikowia . and Pachysolen . 

The DNA sequence according to claim 3. characterized in that the yeast is Pichia stioitis . preferably Pichia stioitis 
CBS 5773 (OSM 5855). — 

The DNA sequence according to claim 1, comprising the following nucleotide sequence: 
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-350 

GGATCCACAGACACTAATTGGTTCTA 

-310 

CATTATTCGTGTTCAGACACAAACCCCAGC 
-290 

GTTGGCGGTTTCTGTCTGCGTTCCTCCAGC 

-250 

ACCTTCTTGCTCAACCCCAGAAGGTGCACA 
-230 

CTGCAGACACACATACATACGAGAACCTGG 

-190 

AACAAATATCGGTGTCGGTGACCGAAATGT 
-170 

GCAAACCCAGACACGACTAATAAACCTGGC 

-130 

AGCTCCAATACCGCCGACAACAGGTGAGGT 
-110 

GACCGATGGGGTGCCAATTAATGTCTGAAA 

-70 

ATTGGGGTATATAAATATGGCGATTCTCCG 
-50 

GAGAATTTTTCAC-TTTTCTTTTCATTTCTC 

-10 

CAGTATTCTTTTCTATACAACTATACTACA 

10 30 
ATGCCTTCTATTAAGTTGAACTCTGGTTAC 

50 

GACATGCCAGCCGTCGGTTTCGGCTGTTGG 
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70 90 
AAAGTCGACGTCGACACCTGTTCTGAACAG 

110 

ATCTACCGTGCTATCAAGACCGGTTACAGA 

130 150 
TTGTTCGACGGTGCCGAAGATTACGCCAAC 

170 

GAAAAGTTAGTTGGTGCCGGTGTCAAGAAG 

130 210 
GCCATTGACGAAGGTATCGTCAAGCGTGAA 

230 

GACTTGTTCCTTACCTCCAAGTTGTGGAAC 

250 270 
AACTACCACCACCCAGACAACGTCGAAAAG 

290 

GCCTTGAACAGAACCCTTTCTGACTTGCAA 

310 330 
GTTGACTACGTTGACTTGTTCTTGATCCAC 

350 

TTCCCAGTCACCTTCAAGTTCGTTCCATTA 

370 390 
GAAGAAAAGTACCCACCAGGATTCTACTGT 

410 

GGTAAGGGTGACAACTTCGACTACGAAGAT 

430 450 
GTTCCAATTTTAGAGACCTGGAAGGCTCTT 

470 

GAAAAGTTGGTCAAGGCCGGTAAGATCAGA 
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490 510 
TCTATCGGTGTTTCTAACTTCCCAGGTGCT 

530 

TTGCTCTTGGACTTGTTGAGAGGTGCTACC 

550 570 
ATCAAGCCATCTGTCTTGCAAGTTGAACAC 

590 

CAC CC ATACTTGCAACAACCAAGATTGATC 

610 630 
GAATTCGCTCAATCCCGTGGTATTGCTGTC 

650 

ACCGCTTACTCTTCGTTCGGTCCTCAATCT 

670 690 
TTCGTTGAATTGAACCAAGGTAGAGCTTTG 

710 

AACACTTCTCCATTGTTCGAGAACGAAACT 

730 750 
ATCAAGGCTATCGCTGCTAAGCACGGTAAG 

770 

TCTCCAGCTCAAGTCTTGTTGAGATGGTCT 

7SC S10 
"T C C C AAA G A G G C A T 7 G C C A T C A T T C C AAA G 

830 

TCCAACACTGTCCCAAGATTGTTGGAAAAC 

850 870 
AAGGACGTCAACAGCTTCGACTTGGACGAA 

890 

CAAGATTTCGCTGACATTGCCAAGTTGGAC 
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910 930 
ATCAACTTGAGATTCAACGACCCATGGGAC 

950 

TGGGACAAGATTCCTATCTTCGTCTAAGAA 

970 99 0 
GGTTGCTTTATAGAGAGGAAATAAAACCTA 

1010 

ATATACATTGATTGTACATTTAAAATTGAA 

1030 105 0 
TATTGTAGCTAGCAGATTCGGAAATTTAAA 

1070 

ATGGGAAGGTGATTCTATCCGTACGAATGA 

1090 1110 
TCTCTATGTACATACACGTTGAAGATAGCA 

1130 

GTACAGTAGACATCAAGTCTACAGATCATT 

1150 1170 
AAACATATCTTAAATTGTAGAAAACTATAA 

1190 

ACTTTT C AATT C AAAC CATGT CTG C CAAGG 

1210 1230 
AATCAAATGAGATTTTTTTCGCAGCCAAAC 

1250 

TTGAJ^TCCAAAAATAAAAAACGTCATTGTC 

1270 12 90 
TGAAACAACTCTATCTTATCTTTCACCTCA 

1310 

TCAATTCATTGCATATCATAAAAGCCTCCG 
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1330 1350 
ATAGCATACAAAACTACTTCTGCATCATAT 

1370 

CTAAATCATAGTGCCATATTCAGTAACAAT 

1390 1410 
ACCGGTAAGAAACTTCTATTTTTTTAGTCT 

1430 

GCCTTAACGAGATGCAGATCGATGCAACGT 

1450 1470 
AAGATCAAACCCCTCCAGTTGTACAGTCAG 

1490 

TCATATAGTGAACACCGTACAATATGGTAT 

1510 1530 
CTACGTTCAAATAGACTCCAATACAGCTGG 

1550 

TCTG CC C AAGTTG AGC AACTTT AATTTAGA 

1570 1590 
GACAAAGTCGTCTCTGTTGATGTAGGCACC 

1610 

ACACATTCTTCTCTTGCCCGTGAACTCTGT 

lg 3C i 6 5o 
T CTGG A GTG G AAA C AT CT C C A GTTGT C AAA 

1670 

TATCAAACACTGACCAGGCTTCAACTGGTA 
1690 

GAAGATTTCGTTTTCGGGATC 
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6. The DNA sequence according to claim 2, comprising the following nucleotide sequence 

5 

10 

15 
20 
25 
30 
35 
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40 
45 
SO 
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-310 -290 
TCTAGACCACCCTAAGTCGTCCCTATGTCG 

-270 

TATGTTTGCCTCTACTACAAAGTTACTAGC 

-250 -230 
AAATATCCGCAGCAACAACAGCTGCCCTCT 

-210 

TCCAGCTTCTTAGTGTGTTGGCCGAAAAGG 

-190 -170 
CGCTTTCGGGCTCCAGCTTCTGTCCTCTGC 

-150 

GGCTGCTGCACATAACGCGGGGACAATGAC 

-130 -no 
TT CT C C AG CTTTT ATT AT AAAAGG AG CC AT 

-90 

CTCCTCCAGGTGAAAAATTACGATCAACTT 

-70 -50 

TTACTCTTTTCCATTGTCTCTTGTGTATAC 

-30 

TCACTTTAGTTTC-TTTCAATCACCCCTAAT 

-10 10 
ACTCTTCACACAATTAAAATGACTGCTAAC 

30 

CCTTCCTTGGTGTTGAACAAGATCGACGAC 

50 70 
ATTTCGTTCGAAACTTACGATGCCCCAGAA 
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90 

ATCTCTGAACCTACCGATGTCCTCGTCCAG 

110 130 
GTCAAGAAAACCGGTATCTGTGGTTCCGAC 

150 

ATCCACTTCTACGCCCATGGTAGAATCGGT 

170 190 
AACTTCGTTTTGACCAAGCCAATGGTCTTG 

210 

GGTCACGAATCCGCCGGTACTGTTGTCCAG 

230 250 
GTTGGTAAGGGTGTCACCTCTCTTAAGGTT 

270 

GGTGACAACGTCGCTATCGAACCAGGTATT 

290 310 
CCATCCAGATTCTCCGACGAATACAAGAGC 

330 

GGTCACTACAACTTGTGTCCTCACATGGCC 

350 370 
TTCGCCGCTACTCCTAACTCCAAGGAAGGC 

390 

GAACCAAACCCACCAGGTACCTTATGTAAG 

410 430 
TACTTCAAGTCGCCAGAAGACTTCTTGGTC 

450 

AAGTTGCCAGACCACGTCAGCTTGGAACTC 
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470 490 
GGTGCTCTTGTTGAGCCATTGTCTGTTGGT 

510 

GTCCACGCCTCCAAGTTGGGTTCCGTTGCT 

530 550 
TTCGGCGACTACGTTGCCGTCTTTGGTGCT 

570 

GGTCCTGTTGGTCTTTTGGCTGCTGCTGTC 

590 610 
GCCAAGACCTTCGGTGCTAAGGGTGTCATC 

630 

GTCGTTGACATTTTCGACAACAAGTTGAAG 

650 670 
ATGGCCAAGGACATTGGTGCTGCTACTCAC 

690 

AC CTT CAACTC CAAGACCGGTGGTTCTGAA 

710 730 
G AATT G AT C AA G G CTTT C G GTG G TAA C GTG 

750 

C C C C- 7 C G T T ^ ~ G — ~ - "rr.-r i rT'-.^-'rir 0 

770 790 
G AA C CTTGTAT CAAGTTGGGTGTTGACGCC 

810 

ATTGCCCCAGGTGGTCGTTTCGTTCAAGTT 

830 850 
GGTAACGCTGCTGGTCCAGTCAGCTTCCCA 
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870 

ATCACCGTTTTCGCCATGAAGGAATTGACT 

890 910 
TTGTTCGGTTCTTTCAGATACGGATTCAAC 

930 

GACTACAAGACTGCTGTTGGAATCTTTGAC 

950 970 
ACTAACTACCAAAACGGTAGAGAAAATGCT 

990 

CCAATTGACTTTGAACAATTGATCACCCAC 

1010 1030 
AGATACAAGTTCAAGGACGCTATTGAAGCC 

1050 

TACGACTTGGTCAGAGCCGGTAAGGGTGCT 

1070 1090 
GTCAAGTGTCTCATTGACGGCCCTGAGTAA 

1110 

GTCAACCGCTTGGCTGGC.CCAAAGTGAACC 

1130 1150 
A G AAA CG AAAA T G A T T A 7 C AAA T A G CTTTA 

1170 

TAG A C CTTT ATC G AAATTT ATGT AAA CTAA 

1190 1210 
TAGAAAAGACAGTGTAGAAGTTATATGGTT 

1230 

GCATCACGTGAGTTTCTTGAATTCTTGAAA 
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1250 



GTGAAGTCTTGGTCGGAACAAACAAACAAA 



1290 



AMATATTTTCAGCAAGAGTTGATTTCTTT 



1310 



tctggagaitttggtaattgacagIgmcc 
ctttctgctattgccItctaaacatccto 

1370 

CCTAGT 



C 



gaatagaactttactggatggccg 13 9 ° 



gttgagtatatatta^c^accaaaatcctg 
tatatIgtctctgaaaaatttgactItcct 

1470 

aacttaacaaaaga 



gagcaccataatgcaagc 



1490 



1510 



tcatagttcttagagacaccaactatactt 
agccaaacaaaatgtcc<?tggcctctaaag 

1550 

A ir.n m ^r * -. 1570 

u ^- — AAGTTGC 

ac^cttcttcatcmgtttacccccagac 
cgtttgccgaatattcggaaaagccttcga 

CTATAGTGGATCC 
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ttuainil gene «MM by aid ONA sequence i„ , ^ mfc^SST^ ^ """* SS ™ °' * 

or xylitol dehydrogenase activity functional enzyme having xylose reductase 



,2 - ssssssssr" " ■ charaMe,s * < ' in *" - dna ««— - - - 

ADH1, ADH2, PDC, GALl/10. 

said structural gene. Promoter, leading to over expression of the protein encoded by 

es acceding ^ of S^Ta ^STdna^ZI sequences comprising Ihe DNA sequenc- 

:;~^™^.rcrr^^ 

''•^^"^cec^g.ociaim.&eha^^ 

2 °' ^ to00,Sa " is ™ » «. «n.«c MM », ma , said m ie,oorganisr„ . Mtt^a^ 
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26. The method according to claim 24 or 25 rh* ♦ ■ ^ 

-yfansse and*, x^o**. Mvl £ • mauce " " m "MM conttlnha K» xylos., „,„,.„„; „ ls> "^ ™ 
Claims for the following Contracting State : ES 
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20 
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40 
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MPSIKLNSGY 

20 

DM PA V G F G C W 

30 

KVDVDTCSEQ 

40 

I Y R A IKTGYR 

LFDGAEDYAN 

60 

EKLVGAGVKK 

7 0 

AIDEGIVKRE 

80 

DLFLTSKLWN 

90 

NYHHPDNVEK 

100 

ALNRTLS DLQ 

110 

VDYVDLFLIH 



FPVTFKFVPL 
EEKYPPgf Y 13 ° 

GKGDNFDYED 
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„ „ „ 150 

VPILETWKAL 

160 

EKLVKAGKI R 

„ „ 170 
SIGVSNFPGA 

is 180 

LLLDLLRGAT 



10 



190 

IKPSVLQVEH 

200 

HPYLQQp RLl 

EFAQSRGIAV 

220 

TAYSSFGPQS 

3s 230 

FVELNQGRAL 



20 



25 



30 



240 

NTSPLFENET 

250 

IKAIAAKHGK 

260 

SPAQVLLRWS 

270 

SQRGIAI IPK 

ss 280 

SNTVPRLLEM 



40 



45 



50 
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KDVNSPD.LD 



290 



300 



QDPADIAKLD 



INLRFNDPW 



310 



WDKI PI'fv* 



35 



75 



20 



25 



30 



35 



40 



45 



SO 
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MTANPSLVL^ 

20 

kiddisfety 

r> , 30 
DAPEISEPTD 

VLVQVKKTGI 

50 

CGSDIHFYAH 

r T> -r 60 

GRIGNFVLTK 

PMVLGHESA 7 G 

TVVQVGKGV 8 T 

90 

SLKVGDNVAI 

- „ ^ 100 
^ P G I p S ?. F S D 

EYKSGHYN i/^ 
P H M A F A A T P^N 



130 

SKEGEPNPPG 
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140 

TLCKYFKSPE 

150 

DFLVKLPDHV 

„ „ 160 
SLELGALVEP 

170 

LSVGVHASKL 

180 

GSVAFGDYVA 

2S 190 

VFGAGPVGLL 



10 



15 



20 



30 



35 



45 



50 



55 



200 

AAAVAKTFGA 

210 

KGVIVVDIFD 

220 

NKLKMAKDIG 



220 

A A T H 7 F N S K T 

240 

GGSEELIKAF 

250 

GGNVPNVVLE 

260 

CTGAEPCIKL 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



G V D A I A p G G 2 ^0 
F V Q V G K A A G 28 p 

T v F A M 

F G S F R 
310 

Y K T A V 
320 

Y Q N G 
I D F E 
^ K F K 
Y D L V R 

v K C L I 

G P E * 



V S F p I - ' - 290 



K E L T L - - 300 



* G F N D " - 310 



G I F D T N 320 
R E N A P - 330 

L I T H R - 340 



A I E A " - 350 



G K G A " - 360 



3 Tho 

4 



LSti£itis 5773 
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GTTGGO 



-290 CCCCAcc 



^^cg^, 



l ^ATgg 



- i9 GA TTCTCCG 



Gtt ac 

g ac ATgccagccg ^ 50 



CGGTTTCG 



GCTGTTGG 
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490 510 
TCTATCGGTGTTTCTAACTTCCCAGGTGCT 

530 

TTGCTCTTGGACTTGTTGAGAGGTGCTACC 

550 570 
ATCAAGCCATCTGTCTTGCAAGTTGAACAC 

590 

CACCCATACTTGCAACAACCAAGATTGATC 

610 630 
GAATTCGCTCAATCCCGTGGTATTGCTGTC 

650 

ACCGCTTACTCTTCGTTCGGTCCTCAATCT 

670 690 
TTCGTTGAATTGAACCAAGGTAGAGCTTTG 

710 

AACACTTCTCCATTGTTCGAGAACGAAACT 

730 750 
ATCAAGGCTATCGCTGCTAAGCACGGTAAG 

770 

TCTCCAGCTCAAGTCTTGTTGAGATGGTCT 

79C 810 
TCCC AAA G A G C- CATTGCC A TCATTCC AAA G 

830 

TC C AA CA CTGT C C C AAG ATTGTTG G AAAAC 

850 870 
AAGGACGTCAACAGCTTCGACTTGGACGAA 

890 

CAAGATTTCGCTGACATTGCCAAGTTGGAC 
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910 930 

ATCAACTTGAGATTCAACGACCCATGGGAC 

950 

TGGGACAAGATTCCTATCTTCGTCTAAGAA 

970 990 
GGTTGCTTTATAGAGAGGAAATAAAACCTA 

1010 

ATATACATTGATTGTACATTTAAAATTGAA 

1030 1050 
TATTGTAGCTAGCAGATTCGGAAATTTAAA 

1070 

ATGGGAAGGTGATTCTATCCGTACGAATGA 

1090 1110 
TCTCTATGTACATACACGTTGAAGATAGCA 

1130 

GTACAGTAGACATCAAGTCTACAGATCATT 

1150 1170 
AAACATATCTTAAATTGTAGAAAACTATAA 

1190 

ACTTTTCAATTCAAACCATGTCTGCCAAGG 

1210 1230 
AATCAAA7G AG ATTTTTTTCG C AG C CAAAC 

1250 

TTGAATCCAA.\AATAAAAAACGTCATTGTC 

1270 1290 
TGAAACAACTCTATCTTATCTTTCACCTCA 

1310 

TCAATTCATTGCATATCATAAAAGCCTCCG 
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1330 1350 
ATAGCATACAAAACTACTTCTGCATCATAT 

1370 

CTAAATCATAGTGCCATATTCAGTAACAAT 

1390 1410 
ACCGGTAAGAAACTTCTATTTTTTTAGTCT 

1430 

GCCTTAACGAGATGCAGATCGATGCAACGT 

1450 1470 
AAGATCAAACCCCTCCAGTTGTACAGTCAG 

1490 

TCATATAGTGAACACCGTACAATATGGTAT 

1510 1530 
CTACGTTCAAATAGACTCCAATACAGCTGG 

1550 

TCTG C C C AAGTTG AGC AACTTT AATTTAG A 

1570 1590 
GACAAAGTCGTCTCTGTTGATGTAGGCACC 

1610 

AC AC ATT CTTCT CTTG C C CGTG AACTCTGT 

163C 1650 
T C T G G A G T C- G AAA CATCTCCAG7TGTC AAA 

1670 

TATCAAACACTGACCAGGCTTCAACTGGTA 
1690 

GAAGATTTCGTTTTCGGGATC 
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6. The method according to claim 2, wherein the DNA sequence comprises the following nucleotide sequence: 
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-310 -290 
TCTAGACCACCCTAAGTCGTCCCTATGTCG 

-270 

TATGTTTGCCTCTACTACAAAGTTACTAGC 

-250 -230 
AAATATCCGCAGCAACAACAGCTGCCCTCT 

-210 

TCCAGCTTCTTAGTGTGTTGGCCGAAAAGG 

-190 -170 
CGCTTTCGGGCTCCAGCTTCTGTCCTCTGC 

-150 

GGCTGCTGCACATAACGCGGGGACAATGAC 

-130 -HO 
TT CT CC AGCTTTTATT ATAAAAGG AG CC AT 

-90 

CTCCTCCAGGTGAAAAATTACGATCAACTT 

-70 -50 
TTACTCTTTTCCATTGTCTCTTGTGTATAC 

-30 

TCACTTTAGTTTGTTTCAATCACCCCTAAT 

-10 10 
A C T CT T C A C A C AA. T T AAAA TGACTGCTAAC 

30 

CCTTCCTTGGTGTTGAACAAGATCGACGAC 

50 70 
ATTTCGTTCGAAACTTACGATGCCCCAGAA 
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90 

ATCTCTGAACCTACCGATGTCCTCGTCCAG 

110 130 
GTCAAGAAAACCGGTATCTGTGGTTCCGAC 

150 

ATCCACTTCTACGCCCATGGTAGAATCGGT 

170 190 
AACTTCGTTTTGACCAAGCCAATGGTCTTG 

210 

GGTCACGAATCCGCCGGTACTGTTGTCCAG 

230 250 
GTTGGTAAGGGTGTCACCTCTCTTAAGGTT 

270 

GGTGACAACGTCGCTATCGAACCAGGTATT 

290 310 
CCATCCAGATTCTCCGACGAATACAAGAGC 

330 

GGTCACTACAACTTGTGTCCTCACATGGCC 

350 370 
TTCGCCGCTACTCCTAACTCCAAGGAAGGC 

390 

GAACCAAACCCACCAGGTACCTTATGTAAG 

410 430 
TACTTCAAGTCGCCAGAAGACTTCTTGGTC 

450 

AAGTTGCCAGACCACGTCAGCTTGGAACTC 
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470 490 

GGTGCTCTTGTTGAGCCATTGTCTGTTGGT 

510 

GTCCACGCCTCCAAGTTGGGTTCCGTTGCT 

530 550 
TTCGGCGACTACGTTGCCGTCTTTGGTGCT 

570 

GGTCCTGTTGGTCTTTTGGCTGCTGCTGTC 

590 610 
GCCAAGACCTTCGGTGCTAAGGGTGTCATC 

630 

GTCGTTGACATTTTCGACAACAAGTTGAAG 

650 670 
ATGGCCAAGGACATTGGTGCTGCTACTCAC 

690 

ACCTTCAACTCCAAGACCGGTGGTTCTGAA 

710 730 
GAATTGATCAAGGCTTTCGGTGGTAACGTG 

7 50 

770 790 
G.\ACCTTC-TATCAAGTTGGGTGTTG.-.CGCC 

810 

ATTGC C C C AGGTGGTCGTTT CGTTC AAGTT 

830 850 
GGTAACGCTGCTGGTCCAGTCAGCTTCCCA 
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870 

ATCACCGTTTTCGCCATGAAGGAATTGACT 

890 910 
TTGTTCGGTTCTTTCAGATACGGATTCAAC 

930 

GACTACAAGACTGCTGTTGGAATCTTTGAC 

950 970 
ACTAACTACCAAAACGGTAGAGAAAATGCT 

990 

CCAATTGACTTTGAACAATTGATCACCCAC 

1010 1030 
AGATACAAGTTCAAGGACGCTATTGAAGCC 

1050 

TACGACTTGGTCAGAGCCGGTAAGGGTGCT 

1070 1090 
GTCAAGTGTCTCATTGACGGCCCTGAGTAA 

'mo 

GTCAACCGCTTGGCTGGCCCAAAGTGAACC 

1130 1130- 
^ / "i — — ~ -^^z i — — z ~~*(~~ ^ ^ "* ^ — ^ CC^T'^ *^ 

1170 

1190 1210 
TAGAAAAGACAGTGTAGAAGTTATATGGTT 

1230 

GCATCACGTGAGTTTCTTGAATTCTTGAAA 
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1250 1270 
GTGAAGTCTTGGTCGGAACAAACAAACAAA 

1290 

AAAATATTTTCAGCAAGAGTTGATTTCTTT 

1310 1330 
TCTGGAGATTTTGGTAATTGACAGAGAACC 

1350 

CCTTTCTGCTATTGCCATCTAAACATCCTT 

1370 1390 
GAATAGAACTTTACTGGATGGCCGCCTAGT 

1410 

GTTGAGTATATATTATCAACCAAAATCCTG 

1430 1450 
TATATAGTCTCTGAAAAATTTGACTATCCT 

1470 

AACTTAACAAAAGAGCACCATAATGCAAGC 

14S0 1510 
TCATAGTTCTTAGAGACACCAACTATACTT 

,1530 

AGGCAAACAAAATGTCCTTGGC CTCTAAAG 
1550 1570 

AAuvr. _ - t_ % - — \ — > L^^^.-.'^rL^u • 

1590 

ACAACTTCTTCATCAAGTTTACCCCCAGAC 

1610 1630 
CGTTTGCCGAATATTCGGAAAAGCCTTCGA 

CTATAGTGGATCC 
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7 A method for preparing a combination of DNA sequences, said method comprising combining a first DN A sequence 
obtainable according to any of claims 1 to 6 and one or more further DNA sequences capable of regulat.ng the 
expression ol a structural gene encoded by said DNA sequence in a host microorganism in a manner known per se. 

8 The method according to claim 7, wherein said combination of sequences comprises modifications of the DNA 
sequences retaining their capability to express a functional enzyme having xylose reductase or xylitol dehydroge- 
nase activity. 

9 The method according to any of claims 7 or 8, wherein said structural gene contains DNA sequences derived from 
the structural gene coding for xylose reductase or xylitol dehydrogenase which modify said protein product while 
retaining its functions in such a way that said protein product is expressed as a gene product having enzymatic 
activity. 

10. The method according to any of claims 7 to 9, wherein said DNA sequences capable of regulating the expression 
of said structural gene in a host microorganism are derived from said host microorganism. 

1 1 . The method according to claim 1 0, wherein said DNA sequences capable of regulating the expression are inducible 
promoters. 

1 2. The method according to claim 1 1 . characterized in that said DNA sequences capable of regulating the expression 
are selected from the following promoters: 

ADH1 . ADH2 , PDC , GAL1/10 . 

1 3 The method according to any of claims 1 0 to 1 2, wherein said DNA sequences capable of regulating the expression 
of said structural gene is a strong promoter, leading to over expression of the protein encoded by said structural 
gene. 

14 The method for preparing a vector, said method comprising inserting a DNA sequence obtainable according to 
any of claims 1 to 6 or a combination of DNA sequences obtainable according to any of claims 7 to 13 into a host 
plasmid. 

1 5 The method according to claim 1 4, characterized in that it produces a vector selected from the group comprising 
the plasmids pRI, pR2, pD1, pD2, pRDI. pXRa, pXRb, pXDH. pXR, pXDH-HIS3, pXR-LEU2. 

1 6 A method for preparing a microorganism being capable of expressing a xylose reductase or xylose reductase and 
xylitol dehydrogenase, wherein DNA sequences comprising the DNA sequences obtainable according to any of 
claims 1 to 6 or a combination of DNA sequences obtainable according to any of claims 7 to 13, coding for said 
xylose reductase or said xylose reductase and said xylitol dehydrogenase, are introduced into a host microorgan- 
ism. 

1 7 The method according to claim 1 6, characterized in that said host microorganism is selected from a group consisting 
of yeast of the genera Saccharomvces , Schizosaccharomyces , Schwanniomvces, Kluyveromvces, Pichia, 
Hansenula , Candida , Debarvomyces , Metschnikowia , Pachvsolen , or Paecilomyces or bacteria of the genus Zfc 
momonas . 

18. The method according to claim 17, characterized in that said microorganism is Saccharornyces cerevisiae . 

19. The method according to claim 17, characterized in that said microorganism is Schizosaccharomyces pombe. 

20. The method according to any of claims 16 to 19, characterized in that said DNA sequence or combination of DNA 
sequences is integrated into the genome of said microorganism. 

21. The method according to any of claims 17 to 21, characterized in that said microorganism is useful in biomass 
production, in food industry and fermentation processes. 

22. The method according to claim 21, characterized in that said microoganism is useful for fermentation of xylose 
into ethanol. 
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23. A method (or producing xylose reductase or xylose reductase and xylitol dehydrogenase by cultivating a microor- 
ganism obtainable according to any of claims 16 to 20 under suitable conditions and recovering said enzyme(s) 
in a manner known per se. 

s 24. The method according to claim 23, characterized in that said microorganism is selected for efficient fermentation 
of xylulose. 

25. The method according to claim 23 or 24, characterized in that said microoganism has received said DNA sequences 
or said combination of DNA sequences by transformation using a vector, said vector being preferably a DNA 

*o fragment or a plasmid. 

26. The method according to claim 25, characterized in that said vector contains DNA, which is homologous to DNA 
of said microorganism, leading to integration into the genome of said microorganism. 

1$ 27. An ethanol manufacturing process, characterized in that a microorganism obtainable according to any of claims 
16 to 22 is used. 

28. A process according to claim 27, characterized in that the fermentation process is adapted for the production of 
alcoholic beverages or single cell protein produced from substrates containing free xylose, preferably released by 

20 xylanase and/or xylosidase activity. 

29. A process for production of biomass, characterized in that a host microorganism according to any of claims 16 to 
22 is used. 
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Patentanspruche 

Patentanspruche f Or folgende Vertragsstaaten : AT, BE, CH f LI, DE, DK, FR, GB, GR, IT, LU, NL, SE 

1. DNA-Sequenz, dadurch gekennzeichnet, daft die DNA-Sequenz ein Strukturgen umfafti das fur eine Xylosere- 
duktase mit der folgenden Aminosauresequenz kodiert: 
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10 

MPSIK LNSGY 

20 

DM.PAVGFGCW 

30 

KVDVDTCS EQ 

40 

IYRAIKTGYR 

50 

LFDGAEDYAN 

60 

EKLVGAGVKK 

70 

AIDEGIVKRE 

80 

DLFLTSKLWN 

90 

NYHHPDNVEK 

100 

ALNRTLS DLQ 

110 

VDYVDLFLIH 

120 

FPVTFKFVPL 

so 13 0 

EEKYPPGFYC 



10 
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140 

GKGDNFDYED 
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150 

VPILETWKAL 

160 

EKLVXAGKIR 

10 170 

SIGVSNFPGA 



180 

LLLDLLRGAT 

190 

IKPSVLQVEH 

200 

HPYLQQPRLI 

210 

EFAQSRG IAV 

so 220 

TAYSSFGPQS 



15 



20 



25 
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45 



230 

FVELNQGR AL 



240 

MTSPLFENET 

250 

IKAIAAKHGK 

260 

SPAQVLLRWS 

so 270 

SQRGIAI IPK 
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280 

SNTVPRLLEN 
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290 

KDV NSPD.LD E 

300 

QD FAD.IAKLD 

- „ „ 310 
INLRFNDPWD 



15 



WDKiPi'pv* 

» wobei die DNA-Sequenz das Polypeptid in einem Mikroorganismus exprimieren kann. 
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10 

M TANPS LVLN 

20 

KIDDISFETY 

30 

DAPEISEPTD 

40 

VLVQVKKTGI 

50 

CGSDIHFYAH 

60 

GRIGNFVLTK 

70 

PMVLGHESAG 

80 

TVVQVGK. GVT 

90 

SLKVGDNVAI 

100 

EPGIPSF. FSD 

110 

EYKSGHYNLC 

120 

PHMAFAATPN 

130 

SKEGEPNPPG 
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140 

TLCKYFKSPE 

150 

DFLVKLPDHV 

160 

SLELGALVEP 

170 

LSVGVHASKL 

180 

GSVAFGDYVA 

190 

VFGAGPVGLL 

200 

AAAVAKTFGA 

210 

KGVIVVD' IFD 

220 

NKLKMAKDIG 

220 

AATHT-FNSKT 

240 

GGSEELIKAF 

» 250 

GGNVPNVVLE 



75 
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CT GAEPCIKL 
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270 

GVDAIAPGGR 

280 

FVQVGNAAGP 

290 

VSFPITVFAM 

300 

KELTLFGS FR 

310 

YGFNDYKTAV 

320 

GIFDTNYQNG 

330 

RENAPIDFEQ 

340 

LITHR.YKFKD 

350 

AIEAYDLVRA 

360 

GKGAVKCLID 



G P E * 



DNA-Sequenz nach Anspruch 1 Oder 2. dadurch gekennzeichnet. daS die DNA-Sequenz von einer Hefe abgeleitet 
*l. bevorzugt von einer Hefe. die aus der aus den Galtungen Schwanniomvces Saceharomvces . Kluveromvces. 
Hjchjg. Hansenula, Candjda, Debaryomyces , Melschnikowia und Pachygoign bestehenden Gruppe ausgewahlt ist. 

CBS^ q 3pSM a S5^ SPrUCh 3 ' dadU,Ch 9ekenn29ichnet - da8 die He,e PJehiastipjtis ist. bevorzugt Pichia stipitis 

DNA-Sequenz nach Anspruch 1, umfassend die folgende Nukleotidsequenz: 
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-350 

GGATCCACAGACACTAATTGGTTCTA 

-310 

CATTATTCGTGTTCAGACACAAACCCCAGC 
-290 

GTTGGCGGTTTCTGTCTGCGTTCCTCCAGC 

-250 

ACCTTCTTGCTCAACCCCAGAAGGTGCACA 
-230 

CTGCAGACACACATACATACGAGAACCTGG 

-190 

AACAAATATCGGTGTCGGTGACCGAAATGT 
-170 

GCAAACCCAGACACGACTAATAAACCTGGC 

-130 

AGCTCCAATACCGCCGACAACAGGTGAGGT 
-110 

GACCGATGGGGTGCCAATTAATGTCTGAAA 

-70 

ATTGGGGTATATAAATATGGCGATTCTCCG 
-50 

GAGAATTTTTCAGTTTTCTTTTCATTTCTC 

-10 

CAGTATTCTTTTCTATACAACTATACTACA 

10 30 
ATGCCTTCTATTAAGTTGAACTCTGGTTAC 

50 

GACATGCCAGCCGTCGGTTTCGGCTGTTGG 
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70 90 
AAAGTCGACGTCGACAC CTGTTCTG AAC AG 

110 

ATCTACCGTGCTATCAAGACCGGTTACAGA 

130 150 
TTGTTCGACGGTGCCGAAGATTACGCCAAC 

170 

GAAAAGTTAGTTGGTGCCGGTGTCAAGAAG 

190 210 
GCCATTGACGAAGGTATCGTCAAGCGTGAA 

230 

GACTTGTTCCTTACCTCCAAGTTGTGGAAC 

250 270 
AACTACCACCACCCAGACAACGTCGAAAAG 

290 

GCCTTGAACAGAACCCTTTCTGACTTGCAA 

310 . .. 330 

GTTGACTACGTTGACTTGTTCTTGATCCAC 

350 

TTCCCAGTCACCTTCAAGTTCGTTCCATTA 

370 390 
GAAGAAAAGTACCCACCAGGATTCTACTGT 

410 

GGTAAGGGTGACAACTTCGACTACGAAGAT 

430 450 
GTTCCAATTTTAGAGACCTGGAAGGCTCTT 

470 

GAAAAGTTGGTCAAGGCCGGTAAGATCAGA 
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490 



TCTATCGGTGTTTCTAACTTCCCAGGTGCT 

TTGCTCTTGGACTTGTTGAGAGGTGCTACC 

550 - 
ATCAAGCCATCTGTCTTGCAAGTTGAACAC 

590 

CACCCATACTTGCAACAACCAAGATTGATC 
610 

GAATTCGCTCAATCCCGTGGTATTGCTGTC 

650 

ACCGCTTACTCTTCGTTCGGTCCTCAATCT 

itcgttgII^tgaaccaaggtagagcttog 

*7 X 0 

aacacttctccattgttcgagaacgaaact 
atcaaggctatcgctgctaagcacggtIIg 
tctccagctcaagtcttgttgagatggtct 

790 S10 
"CCCAAAGAGGCATTGCCATCATTCCAAAG 

830 

tccaacactgtcccaagattgttggaaaac 

850 87Q 

aaggacgtcaacagcttcgacttggacgaa 

890 

caagatttcgctgacattgccaagttggac 
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910 930 
ATCAACTTGAGATTCAACGACCCATGGGAC 

950 

TGGGACAAGATTCCTATCTTCGTCTAAGAA 

970 990 
GGTTGCTTTATAGAGAGGAAATAAAACCTA 

1010 

ATATACATTGATTGTACATTTAAAATTGAA 

1030 1050 
TATTGTAGCTAGCAGATTCGGAAATTTAAA 

1070 

ATGGGAAGGTGATTCTATCCGTACGAATGA 

1090 1110 
TCTCTATGTACATACACGTTGAAGATAGCA 

1130 

GTACAGTAGACATCAAGTCTACAGATCATT 

1150 1170 
AAACATATCTTAAATTGTAGAAAACTATAA 

1190 

ACTTTTCAATTCAAACCATGTCTGCCAAGG 

1210 1230 
AATCAAATGAGATTTTTTTCGCAGCCAAAC 

1250 

TTGAATCCAAAAATAAAAAACGTCATTGTC 

1270 1290 
TGAAACAACTCTATCTTATCTTTCACCTCA 

1310 

TCAATTCATTGCATATCATAAAAGCCTCCG 
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1330 1350 
ATAGCATACAAAACTACTTCTGCATCATAT 

1370 

CTAAATCATAGTGCCATATTCAGTAACAAT 

1390 1410 
ACCGGTAAGAAACTTCTATTTTTTTAGTCT 

1430 

GCCTTAACGAGATGCAGATCGATGCAACGT 

1450 1470 
AAGATCAAACCCCTCCAGTTGTACAGTCAG 

1490 

TCATATAGTGAACACCGTACAATATGGTAT 

1510 1530 
CTACGTTCAAATAGACTCCAATACAGCTGG 

1550 

TCTGCCCAAGTTGAGCAACTTTAATTTAGA 

1570 1590 
GACAAAGTCGTCTCTGTTGATGTAGGCACC 

1610 

ACACATTCTTCTCTTGCCCGTGAACTCTGT 

1630 1650 
TCTGGAGTGGAAACATCTCCAGTTGTCAAA 

1670 

TATCAAACACTGACCAGGCTTCAACTGGTA 
1690 

GAAGATTTCGTTTTCGGGATC 
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6. ONA-Sequenz nach Anspruch 2, umfassend die folgende Nukleotidsequenz: 

5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
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* 310 -290 
TCTAGACCACCCTAAGTCGTCCCTATGTCG 

-270 

TATGTTTGCCTCTACTACAAAGTTACTAGC 

" 250 -230 
AAATATCCGCAGCAACAACAGCTGCCCTCT 

-210 

TCCAGCTTCTTAGTGTGTTGGCCGAAAAGG 

" 190 -170 
CGCTTTCGGGCTCCAGCTTCTGTCCTCTGC 

-150 

GGCTGCTGCACATAACGCGGGGACAATGAC 

"* 130 -110 
TTCTCCAGCTTTTATTATAAAAGGAGCCAT 

-90 

CTCCTCCAGGTGAAAAATTACGATCAACTT 

~ 70 ' ' "50 

TTACTCTTTTCCATTGTCTCTTGTGTATAC 

-30 

TCACTTTAGTTTGTTTCAATCACCCCTAAT 

" i0 10 
ACTCTTCACACAATTAAAATGACTGCTAAC 

30 

CCTTCCTTGGTGTTGAACAAGATCGACGAC 

50 70 
ATTTCGTTCGAAACTTACGATGCCCCAGAA 
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90 

ATCTCTGAACCTACCGATGTCCTCGTCCAG 

110 130 
GTCAAGAAAACCGGTATCTGTGGTTCCGAC 

150 

ATCCACTTCTACGCCCATGGTAGAATCGGT 

170 190 
AACTTCGTTTTGACCAAGCCAATGGTCTTG 

210 

GGTCACGAATCCGCCGGTACTGTTGTCCAG 

230 250 
GTTGGTAAGGGTGTCACCTCTCTTAAGGTT 

270 

GGTGACAACGTCGCTATCGAACCAGGTATT 

290 • • 310 

CCATCCAGATTCTCCGACGAATACAAGAGC 

330 

GGTCACTACAACTTGTGTCCTCACATGGCC 

350 370 
TTCGCCGCTACTCCTAACTCCAAGGAAGGC 

390 

GAACCAAACCCACCAGGTACCTTATGTAAG 

410 430 
TACTTCAAGTCGC CAGAAGACTTCTTGGTC 

450 

AAGTTGCCAGACCACGTCAGCTTGGAACTC 
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470 490 

GGTGCTCTTGTTGAGCCATTGTCTGTTGGT 

510 

GTCCACGCCTCCAAGTTGGGTTCCGTTGCT 

530 550 
TT CGGCGACTACGTTG CCGTCTTTGGTG CT 

570 

GGTCCTGTTGGTCTTTTGGCTGCTGCTGTC 

590 610 
GCCAAGACCTTCGGTGCTAAGGGTGTCATC 

630 

GTCGTTGACATTTTCGACAACAAGTTGAAG 

650 670 
ATGGCCAAGGACATTGGTGCTGCTACTCAC 

690 

AC CTTCAACT C CAAGACCGGTGGTTCTG AA 

"710 730 
GAATTGATCAAGGCTTTCGGTGGTAACGTG 

750 

G C AAAC GTCG T1"I7 3 G AATGT A CTGGTG CT 

770 790 
G AACCTTG T ATCAAG7TGGGTGTTG ACGCC 

810 

ATTGCC C C AG GTGGTCGTTTCGTTC AAGTT 

830 850 
GGTAACGCTGCTGGTCCAGTCAGCTTCCCA 
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870 

ATCACCGTTTTCGCCATGAAGGAATTGACT 

890 910 
TTGTTCGGTTCTTTCAGATACGGATTCAAC 

93Q 

GACTACAAGACTGCTGTTGGAATCTTTGAC 

950 970 
ACTAACTACCAAAACGGTAGAGAAAATGCT 

990 

CCAATTGACTTTGAACAATTGATCACCCAC 

1010 1030 
AGATACAAGTTCAAGGACGCTATTGAAGCC 

1050 

TACGACTTGGTCAGAGCCGGTAAGGGTGCT 

1070 1090 
GTCAAGTGTCTCATTGACGGCCCTGAGTAA 

1110 

GTCAACCGCTTGGCTGGCCCAAAGTGAACC 

1130 H50 
^iiir^ti - - - ™*G - " rr "' iTQ - - - AG CTTTA. 

1170 

TAGACCTTTATCGAAATTTATGTAAACTAA 

1190 1210 
TAGAAAAGACAGTGTAGAAGTTATATGGTT 

1230 

GCATCACGTGAGTTTCTTGAATTCTTGAAA 
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1250 1270 
GTGAAGTCTTGGTCGGAACAAACAAACAAA 

1290 

AAAATATTTTCAGCAAGAGTTGATTTCTTT 

_ "10 1330 
TCTGGAGATTTTGGTAATTGACAGAGAACC 

1350 

CCTTTCTGCTATTGCCATCTAAACATCCTT 

1370 * 1390 

GAATAGAACTTTACTGGATGGCCGCCTAGT 

1410 

GTTGAGTATATATTATCAACCAAAATCCTG 

1430 1450 
TATATAGTCTCTGAAAAATTTGACTATCCT 

1470 

AACTTAACAAAAGAGCACCATAATGCAAGC 

1490 1510 
TCATAGTTCTTAGAGACACCAACTATACTT 

1530 

AGCCAAACAAAATGTCCTTGGCCTCTAAAG 

1550 1570 
^•GCATTCAGCAAGCTTCCCCAGAAGTTGC 

1590 

ACAACTTCTTCATCAAGTTTACCCCCAGAC 

1610 1630 
CGTTTGCCGAATATTCGGAAAAGCCTTCGA 

CTATAGTGGATCC 
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7. DNA-Sequenz nach einem der Anspruche 1 bis 6, dadurch gekennzeichnet, daB sie durch rekombtnante DNA- 
Technologie aus naturlicher DNA und/oder cDNA und/oder chemisch synthetisierter DNA erhalten wtrd. 

8. Kombination von DNA-Sequenzen, dadurch gekennzeichnet, daB die Kombinantion eine erste DNA-Sequenz ge- 
s maB einem der Anspruche 1 bis 7 und eine Oder mehrere DNA-Sequenzen umfaBt, die die Expression eines von 

der DNA-Sequenz kodierten Strukturgenes in einem Wirtsorganismus regulieren kann. 

9. Kombination von DNA-Sequenzen nach Anspruch 8, dadurch gekennzeichnet, daB die Kombination Modifikatio- 
nen der DNA-Sequenzen umfaBt, die ihre Fahigkeit zur Expression eines funktionellen Enzymes mit Xybsereduk- 

io tase- oder Xytitoldehydrogenase-Aktivitat autrecht erhalt. 

10. Kombination von DNA-Sequenzen nach Anspruch 8 oder 9, dadurch gekennzeichnet. daB das Strukturgen DNA- 
Sequenzen enthatt, die von dem fOr Xylosereduktase oder Xylitoldehydrogenase kodierenden Strukturgen abge- 
leitet sind, die das Proteinprodukt modifizieren, wan rend seine Funktionen auf eine solche Weise beibehalten 

is werden, daB das Proteinprodukt als ein Genprodukt mit enzymatischer Aktivitat exprimiert wird. 

11. Kombination von DNA-Sequenzen nach einem der Anspruche 8 bis 10, dadurch gekennzeichnet, daB die DNA- 
Sequenzen, die Expression des Strukturgenes in einem Wirtsorganismus regulieren konnen, von dem Wirtsorga- 
nismus abgeleitet sind. 

20 

12. Kombination nach Anspruch 11 , dadurch gekennzeichnet, daB die DNA-Sequenzen, die die Expression regulieren 
konnen, induzierbare Promotoren sind. 

13. Kombination nach Anspruch 12, dadurch gekennzeichnet, daB die DNA-Sequenzen, die die Expression regulieren 
25 konnen, aus den folgenden Promotoren ausgewahlt sind: 

ADH1 , ADH2 . PDC , GAL1/10 . 

14. Kombination nach einem der Anspruche 11 bis 13, dadurch gekennzeichnet, daB die DNA-Sequenz, die die Ex- 
pression des Strukturgenes regulieren kann, ein starker Promotor ist, was zur Uberexpression des von dem Struk- 

30 turgen kodierten Proteines fuhrt. 

15. Vektor, dadurch gekennzeichnet, daB der Vektor eine DNA-Sequenz nach einem der Anspruche 1 bis 7 oder eine 
Kombination von DNA-Sequenzen nach einem der Anspruche 8 bis 14 umfaBt. 

35 16. Vektor nach Anspruch 15, dadurch gekennzeichnet, daB der Vektor aus der Gruppe ausgewahlt ist, die die Ptas- 
mide pR1, pR2/pD1, pD2, pRD1, pXRa, pXRb, pXDH, pXR, pXDH-HIS3, pXR-LEU2 umfaBt. 

17. Mikroorganismus, dadurch gekennzeichnet, daB der Mikroorganismus eine Xylosereduktase oder Xylosereduk- 
tase und Xylitolhydrogenase mittels rekombinanter DNA-Technologie exprimieren kann als Ergebnis dessen, daB 

to er DNA-Sequenzen erhalten hat, die DNA-Sequenzen nach einem der Anspruche 1 bis 7 oder eine Kombination 

von DNA-Sequenzen nach einem der Anspruche 8 bis 14 umlassen, die fur die Xylosereduktase oder die Xylo- 
sereduktase und Xylitolhydrogenase kodieren. 

18. Mikroorganismus nach Anspruch 17, dadurch gekennzeichnet, daB der Mikroorganismus aus einer Gruppe aus- 
45 gewahlt ist, die aus Hefe der Gattungen Saccharomyces . Schizosaccharomvces , Schwanniomvces . Kluwero- 

mvces , Pichia , Hansen ula , Candida , Debarvomyces , Metschnikowia . Pachysolen oder Paecilomvces oder Bak- 
terien der Gattung Zymomonas besteht. 

19. Mikroorganismus nach Anspruch 18, dadurch gekennzeichnet, daB der Mikroorganismus Saccharomyces cere- 
50 visiae ist. 

20. Mikroorganismus nach Anspruch 18, dadurch gekennzeichnet, daB der Mikroorganismus Schizosaccharomvces 
pombe ist. 

55 21 . Mikroorganismus nach einem der Anspruche 1 7 bis 20, dadurch gekennzeichnet, daB die DNA-Sequenz oder eine 
Kombination von DNA-Sequenzen in das Genom des Mikroorganismus integriert ist. 

22. Mikroorganismus nach einem der Anspruche 17 bis 21, dadurch gekennzeichnet, daB der Mikroorganismus fur 
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die Erzeugung von Biomasse, in der Nahrungsmittelindustrie oder in Fermentationsverfahren nOtzlich ist. 
24. Verfahren zum Erzeugen von Xylosereduktase oder Xylosereduktase und Xylitoldehydrooenase durch Kultivi*r*n 

26. Vferfahrer , nach Anspruch 24 oder 25, dadurch gekennzeichnet. daf} der Mikroorganismus die DNA-Seouenzen 

28 ' SSS!SrSS h ^ ^ ** ei " ^-ganisrnus gemaB einem der AnsprOcne 

29 ' amohShTn r A , n T Ch i 9 ' c adUrCh 9 ekennzeichnet - *8 das Fermentationsverfahren an die Erzeugung von 
T k ^ Einze,2ell P f0tei " "WaUt ist, die aus Subs.raten erzeugt werden die frei Xvlo!e 

enthalten, d,e bevorzugt durch Xylanase- und/oder Xylosidase-Aktivitat freigesetzt wird * 

Cr^ 7 m b r^ ^ - * Wi.sorganismus gem, 8 einem der 

Patentanspruche fur folgenden Vertragsstaat : ES 
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10 

MPSIK LN6GY 

20 

DMPAVGFGCW 



30 



KVDVDTCSEQ 



IS 



40 

IY RAIKTGYR 



20 



50 

LFDGAEDYAN 

60 

EKLVGAGVKK 



2S 



70 

AIDEGIVKRE 



30 



80 

DLFLTSKLWN 



35 



90 

NYHHPDNVEK 

' 100 

ALNRTLS DLQ 



.40 



110 

VDYVDLFLIH 



45 



120 



FPVTFKFVPL 



50 



EEKYPPGF 



130 
Y C 



55 



140 



GKGDNFDYED 
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150 

VP ILET WKA L 

160 

EKLVKAGKI R 

170 

SIGVSNFPGA 

180 

LL LD LLR G A T 

190 

IKPSVLQVEH 

200 

HPYLQQPRLI 

210 

EFAQSRGIAV 

220 

TAYSSFGPQS 

230 

FVELNQGRAL 

240 

NTSPLFENET 

250 

IKAIAAKHGK 

45 260 

SPAQVLLRWS 

270 

so SQRGIAIIPK 

280 

SNTVPRLLEN 

55 



10 



IS 



20 



25 



30 



35 



40 
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290 

KDVNSPD.LDE 

300 

QD FADIAKLD 

310 

INLRFND PWD 



WDKIPI'FV* 



wobei die DNA-Sequenz das Polypeptid in einem Mikroorganismus exprimieren kann, wobei die DNA-Sequenz 
mittels rekombinanter DNA-Techologie aus naturlicher DNA und/oder cDNA und/oder chemisch synthetisierter 
DNA hergesteltt wird. 

Vertahren nach Anspruch 1 , wobei die DNA-Sequenz weiter ein fur Xylitoldehydrogenase mit der folgenden Ami- 
nosauresequenz kodierendes Strukturgen umfaRt: 
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10 

MTANPSLVLN 

20 

KIDDISFET Y 

30 

DAPEISEPTD 

« 40 

VLVQVKKTGI 



70 



50 

CGSDIHFYAH 

60 

GRIGNFVLTK 

70 

PMVLGHESAG 

80 

TVVQVGKGVT 

90 

SLKVGDNVAI 

40 100 

EPGIPSRFSD 



20 



25 



30 



35 



45 



SO 



55 



110 

EYKSGHYNLC 

120 

PHMAFAATPN 

130 

SKEGEPNPPG 
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140 

T LCKY FKSPE 

150 

DFLVKLPDHV 

160 

SLELGALVEP 

is 170 

LSVGVHASKL 



10 



20 



25 



30 



35 



40 



45 



180 

GSVAFGDYVA 

190 

VFGAG PVGLL 

200 

AAAVAKT FGA 

210 

KGVIvyD, IFD 

220 

NKLKMAKDIG 

220 

A A T H T F i," S K T 

240 

GGSEELIKAF 



250 

so GGNVPNVVLE 

260 

S5 CTGAEPCIKL 
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270 
G R 

280 
G P 

290 
A M 

300 
F R 

310 
A V 

320 
N G 

330 
E Q 

340 
K D 

350 
R A 

360 
I D 



G P E * 



Verfahren nach einem der Anspruche 1 oder 2, dadurch gekennzeichnet, daB die DNA-Sequenz von einer Hefe 
abgeleitet ist, bevorzugt von einer Hefe, die aus der Gruppe ausgewahit ist, die aus den Gattunqen Schwannio- 
myces, Saccharomvces . Kluweromvces . Pichia . Hansenula . Candida , Debaryomyces , Metschnikowia und Pa- 
chysolen besteht. 

Verfahren nach Anspruch 3, dadurch gekennzeichnet, daf3 die Hefe Pichia stipitis ist, bevorzugt Pichia stipitis CBS 
5773 (DSM 5855). 

Verfahren nach Anspruch 1, wobei die DNA-Sequenz die folgende Nukieotidsequenz umfaBt: 



G VDAIAPG 
FVQVGNAA 
VSFPIT VF 
KELTLFGS 
YGFNDYKT 
GIFDTNYQ 
RENAPIDF 
LITHRYKF 
AIEAYDLV 
GKGAVKCL 
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-350 

GGATCCACAGACACTAATTGGTTCTA 

-310 

CATTATTCGTGTTCAGACACAAACCCCAGC 
-290 

GTTGGCGGTTTCTGTCTGCGTTCCTCCAGC 

-250 

ACCTTCTTGCTCAACCCCAGAAGGTGCACA 
-230 

CTGCAGACACACATACATACGAGAACCTGG 

-190 

AACAAATATCGGTGTCGGTGACCGAAATGT 
-170 

GCAAACCCAGACACGACTAATAAACCTGGC 

-130 

AGCTCCAATACCGCCGACAACAGGTGAGGT 
-110 

GACCGATGGGGTGCCAATTAATGTCTGAAA 

-70 

ATTGGGGTATATAAATATGGCGATTCTCCG 
-50 

-10 

CAGTATTCTTTTCTATACAACTATACTACA 

10 30 
ATGCCTTCTATTAAGTTGAACTCTGGTTAC 

50 

GACATGCCAGCCGTCGGTTTCGGCTGTTGG 
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70 90 
AAAGTCGACGTCGACACCTGTTCTGAACAG 

110 

ATCTACCGTGCTATCAAGACCGGTTACAGA 

"0 150 
TTGTTCGACGGTGCCGAAGATTACGCCAAC 

170 

GAAAAGTTAGTTGGTGCCGGTGTCAAGAAG 

190 210 
GCCATTGACGAAGGTATCGTCAAGCGTGAA 

230 

GACTTGTTCCTTACCTCCAAGTTGTGGAAC 

250 270 
AACTACCACCACCCAGACAACGTCGAAAAG 

290 

GCCTTGAACAGAACCCTTTCTGACTTGCAA 

310 330 
GTTGACTACGTTGACTTGTTCTTGATCCAC 

350 

TTCCCAGTCACCTTCAAGTTCGTTCCATTA 

370 390 
G AAC- AAAAGTAC C C AC C AGG ATT CT ACTGT 

410 

GGTAAGGGTGACAACTTCGACTACGAAGAT 

430 450 
GTTCCAATTTTAGAGACCTGGAAGGCTCTT 

470 

GAAAAGTTGGTCAAGGCCGGTAAGATCAGA 



79 



EP 0 450 430 B1 



490 510 

TCTATCGGTGTTTCTAACTTCCCAGGTGCT 

530 

TTGCTCTTGGACTTGTTGAGAGGTGCTACC 

550 570 
ATCAAGCCATCTGTCTTGCAAGTTGAACAC 

590 

CAC CCATACTTGCAACAACCAAGATTGATC 

610 630 
GAATTCGCTCAATCCCGTGGTATTG CTGTC 

650 

ACCGCTTACTCTTCGTTCGGTCCTCAATCT • 

670 690 
TTCGTTGAATTGAACCAAGGTAGAGCTTTG 

710 

AACACTTCTCCATTGTTCGAGAACGAAACT 

730 750 
ATCAAGGCTATCGCTGQTAA.GCACGGTAAG 

770 

TCTCCAGCTCAAGTCTTGTTGAGATGGTCT 

79C 810 
T CC CAAAGAGG CATTGCC AT C ATT C CAAAG 

830 

TCCAA C ACTGT CC CAAG ATTGTTG G AAAA C 

850 870 
AAGGACGTCAACAGCTTCGACTTGGACGAA 

890 

CAAG ATTT CG CTG ACATTG C CAAGTTGGA C 
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910 930 

ATCAACTTGAGATTCAACGACCCATGGGAC 

950 

TGGGACAAGATTCCTATCTTCGTCTAAGAA 

970 990 
GGTTGCTTTATAGAGAGGAAATAAAACCTA 

1010 

ATATACATTGATTGTACATTTAAAATTGAA 

1030 1050 
TATTGTAGCTAGCAGATTCGGAAATTTAAA 

1070 

ATGGGAAGGTGATTCTATCCGTACGAATGA 

1090 1110 
TCTCTATGTACATACACGTTGAAGATAGCA 

1130 

GTACAG7AGACATCAAGTCTACAGATCATT 

1150 1170 
AAACATATCTTAAATTGTAGAAAACTATAA 

• • » 

v 1190 

ACTTTTCAATTCAAACCATGTCTGCCAAGG 

1210 1230 
AATCAAATGAGATTTTTTTCGCAGCCAAAC 

1250 

TTGAATCCAAAAATAAAAAACGTCATTGTC 

1270 1290 
TGAAACAACTCTATCTTATCTTTCACCTCA 

1310 

TCAATTCATTGCATATCATAAAAGCCTCCG 
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1330 1350 
ATAGCATACAAAACTACTTCTGCATCATAT 

1370 

CTAAATCATAGTGCCATATTCAGTAACAAT 

1390 1410 
ACCGGTAAGAAACTTCTATTTTTTTAGTCT 

1430 

GCCTTAACGAGATGCAGATCGATGCAACGT 

1450 1470 
AAGATCAAACCCCTCCAGTTGTACAGTCAG 

1490 

TCATATAGTGAACACCGTACAATATGGTAT 

1510 1530 
CTACGTTCAAATAGACTCCAATACAGCTGG 

1550 

TCTGCCCAAGTTGAGCAACTTTAATTTAGA 

1570 1590 
GACAAAGTCGTCTCTGTTGATGTAGGCACC 

1610 

ACACATTCTTCTCTTGCCCGTGAACTCTGT 

163C 1650 
? CTGG AGTG G AAA C AT CT C CAGTTGT C AAA 

1670 

TATCAAACAGTGACCAGGCTTCAACTGGTA 
1690 

GAAGATTTCGTTTTCGGGATC 
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6. Verfahren nach Anspruch 2, wobei die DNA-Sequenz die folgenden Nukleotidsequenz umfaGt: 

5 

10 

1$ 

20 
25 
30 
35 
40 
45 
SO 
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-310 -290 
TCTAGACCACCCTAAGTCGTCCCTATGTCG 

-270 

TATGTTTGCCTCTACTACAAAGTTACTAGC 

-250 -230 
AAATATCCGCAGCAACAACAGCTGCCCTCT 

-210 

TCCAGCTTCTTAGTGTGTTGGCCGAAAAGG 

-190 -170 
CGCTTTCGGGCTCCAGCTTCTGTCCTCTGC 

-150 

GGCTGCTGCACATAACGCGGGGACAATGAC 

-130 -110 
TTCTCCAGCTTTTATTATAAAAGGAGCCAT 

-90 

CTCCTCCAGGTGAAAAATTACGATCAACTT 

-70 ' -50 

TTACTCTTTTCCATTGTCTCTTGTGTATAC 

-30 

TCACTTTAGTTTGTTTCAATCACCCCTAAT 

-10 10 
ACTCTTCACACAATTAAAATGACTGCTAAC 

30 

CCTTCCTTGGTGTTGAACAAGATCGACGAC 

50 70 
ATTTCGTTCGAAACTTACGATGCCCCAGAA 
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90 

ATCTCTGAACCTACCGATGTCCTCGTCCAG 

110 130 
GTCAAGAAAACCGGTATCTGTGGTTCCGAC 

150 

ATCCACTTCTACGCCCATGGTAGAATCGGT 

170 190 
AACTTCGTTTTGACCAAGCCAATGGTCTTG 

210 

GGTCACGAATCCGCCGGTACTGTTGTCCAG 

230 250 
GTTGGTAAGGGTGTCACCTCTCTTAAGGTT 

270 

GGTGACAACGTCGCTATCGAACCAGGTATT 

290 310 
CCATCCAGATTCTCCGACGAATACAAGAGC 

330 

GGTCACTACXACTTGTGTCCTCACATGGCC 

350 370 
TTCGCCGCTACTCCTAACTCCAAGGAAGGC 

390 

GAACCAAACCCACCAGGTACCTTATGTAAG 

410 430 
TACTTCAAGTCGCCAGAAGACTTCTTGGTC 

450 

AAGTTGCCAGACCACGTCAGCTTGGAACTC 
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470 490 

GGTGCTCTTGTTGAGCCATTGTCTGTTGGT 

510 

GTCCACGCCTCCAAGTTGGGTTCCGTTGCT 

530 550 
TTCGGCGACTACGTTGCCGTCTTTGGTGCT 

570 

GGTCCTGTTGGTCTTTTGGCTGCTGCTGTC 

590 610 
GCCAAGACCTTCGGTGCTAAGGGTGTCATC 

630 

GTCGTTG ACATTTT CGACAACAAGTTGAAG 

650 670 
ATGGCCAAGGACATTGGTGCTGCTACTCAC 

690 

ACCTTCAACTCCAAGAGCGGTGGTTCTGAA 

710 *" 730 

GAATTGATCAAGGCTTTCGGTGGTAACGTG 

750 

£^ — — — t - * / t r "' % c rp::rr "'tr. > > ~ t* ^ £ r rfzn' rmi (Z£ rri 

770 790 
GAACCTTGTATCAAGTTGGGTGTTGACGCC 

810 

ATTGCCCCAGGTGGTCGTTTCGTTCAAGTT 

830 850 
GGTAACGCTGCTGGTCCAGTCAGCTTCCCA 
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870 

ATCACCGTTTTCGCCATGAAGGAATTGACT 

890 910 
TTGTTCGGTTCTTTCAGATACGGATTCAAC 

93Q 

GACTACAAGACTGCTGTTGGAATCTTTGAC 

950 970 
ACTAACTACCAAAACGGTAGAGAAAATGCT 

990 

CCAATTGACTTTGAACAATTGATCACCCAC 

1010 1030 
AGATACAAGTTCAAGGACGCTATTGAAGCC 

1050 

TACGACTTGGTCAGAGCCGGTAAGGGTGCT 

1070 1090 
GTCAAGTGTCTCATTGACGGCCCTGAGTAA 

'lllO 

G7CAACCGCTTGGCTGGCCCAAAGTGAACC 
1130 1150 

1170 

TAGACCTTTATCGAAATTTATGTAAACTAA 

1190 1210 
TAGAAAAGACAGTGTAGAAGTTATATGGTT 

1230 

GCATCACGTGAGTTTCTTGAATTCTTGAAA 
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1250 1270 
GTGAAGTCTTGGTCGGAACAAACAAACAAA 

1290 

AAAATATTTTCAGCAAGAGTTGATTTCTTT 

1310 1330 
TCTGGAGATTTTGGTAATTGACAGAGAACC 

1350 

CCTTTCTGCTATTGCCATCTAAACATCCTT 

1370 1390 
GAATAGAACTTTACTGGATGGCCGCCTAGT 

1410 

GTTGAGTATATATTATCAACCAAAATCCTG 

1430 1450 
TATATAGTCTCTGAAAAATTTGACTATCCT 

1470 

AACTTAACAAAAGAGCACCATAATGCAAGC 

1490 1510 
TCATAGTTCTTAGAGACACCAACTATACTT 

•1530 

AGCCAAACAAAATGTCCTTGGCCTCTAAAG 

1550 1570 
AAG C ATT C AGC AAG CTT C C C C AG AAGTTG C 

1590 

ACAACTTCTTCATCAAGTTTACCCCCAGAC 

1610 1630 
CGTTTGCCGAATATTCGGAAAAGCCTTCGA 

CTATAGTGGATCC 
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7. Verfahrert zum Herstellen einer Kombination von DN A-Sequenzen, wobei das Verfahren das Vereinen einer ersten 
DN A-Sequenz, erhaltlich gemaB einem der Anspruche 1 bis 6, und einer oder mehrerer weiterer DNA-Sequenzen, 
die die Expression eines von der DNA-Sequenz kodierten Strukturgenes in einem Wirtsorganismus regulieren 
konnen, in an sich bekannter Weise umfaBt. 

8. Verfahren nach Anspruch 7, wobei die Kombination von Sequenzen Modifikationen der DNA-Sequenzen umfaBt, 
die deren Fahigkeit zur Expression eines f unktionellen Enzymes mit Xylosereduktase- oder Xylitoldehydrogenase- 
Aktivitat aufrecht erhalt. 

9. Verfahren nach einem der Anspruche 7 oder 8, wobei das Strukturgen DNA-Sequenzen enthaft, die von dem fur 
Xylosereduktase oder XylrtokJehydrogenase kodierenden Strukturgen abgeleitet sind, die das Proteinprodukt mo- 
difizieren, wahrend sie seine Funktionen auf eine solche Weise aufrecht erhalten, daB das Proteinprodukt als ein 
Genprodukt mit enzymatischer Aktivitat exprimierl wird. 

10. Verfahren nach einem der Anspruche 7 bis 9, wobei die DNA-Sequenzen, die die Expression des Strukturgenes 
in einem Wirtsorganismus regulieren konnen, von dem Wirtsorganismus abgeleitet sind. 

11. Verfahren nach Anspruch 10, wobei die DNA-Sequenzen, die die Expression regulieren konnen, induzierbare 
Promotoren sind. 

12. Verfahren nach Anspruch 11, dadurch gekennzeichnet : daft die DNA-Sequenzen, die die Expression regulieren 
konnen, aus den folgenden Promotoren ausgewahlt sind: 

ADH1 , ADH2 . PDC . GAL1/10 . 

1 3. Verfahren nach einem der Anspruche 1 0 bis 1 2, wobei die DNA-Sequenzen, die die Expression des Strukturgenes 
regulieren konnen, starke Promotor sind, was zur Oberexpression des von dem Strukturgen kodierten Proteines 
fuhrt. 

14. Verfahren zum Herstellen eines Vektors, wobei das Verfahren das Insertieren einer DNA-Sequenz, die gemaB 
einem der Anspruche 1 bis 6 erhaltlich ist, oder einer Kombination von DNA-Sequenzen, die gemaB einem der 
Anspruche 7 bis 13 erhaltlich ist, in ein Wirtsplasmid umfaBt. 

15. Verfahren nach Anspruch 14, dadurch gekennzeichnet, daB es einen Vektor erzeugt, der aus der Gruppe ausge- 
wahlt ist, die die Plasmide pR1 , pR2. pD1 , pD2, pRD1 , pXRa, pXRb, pXDH, pXR, pXDH-HIS3, pXR-LEU2 umfaBt. 

16. Verfahren zum Herstellen eines Mikroorganismus, der Xylosereduktase oder Xylosereduktase und Xylrtolhydro- 
genase exprimieren kann. wobei DNA-Sequenzen, die die gemaB einem der Anspruche 1 bis 6 erhaltiichen DNA- 
Sequenzen oder eine Kombination von DNA-Sequenzen, die gemaB einem der Anspruche 7 bis 13 erhaltlich ist, 
und die fur die Xylosereduktase oder die Xylosereduktase und Xylitolhydrogenase kodieren, umfassen, in einen 
Wirtsmikroorganismus eingefuhrt werden. 

17. Verfahren nach Anspruch 16, dadurch gekennzeichnet. daB der Wirtsmikroorganismus aus einer Gruppe ausge- 
wahlt ist. die aus Hefen der Gattungen Saccharomvces . Schizosaccharomvces . Schwanniomyces . Kluvvero- 
m Y ces * Pichia. Hansenula . Candida . Debaryomyces . Metschnikowia . Pachysolen oder Paecilomvces oder Bak- 
terien der Gattung Zymomonas besteht. 

18. Verfahren nach Anspruch 17, dadurch gekennzeichnet, daB der Mikroorganismus Saccharomvces cerevisiae ist. 

19. Verfahren nach Anspruch 17, dadurch gekennzeichnet, daB der Mikroorganismus Schizosaccharomvces pombe 
ist. 

20. Verfahren nach einem der Anspruche 16 bis 19, dadurch gekennzeichnet, daB die DNA-Sequenz oder eine Kom- 
bination von DNA-Sequenzen in das Genom des Mikroorganismus integriert wird. 

21. Verfahren nach einem der Anspruche 17 bis 21, dadurch gekennzeichnet, daB der Mikroorganismus bei der Er- 
zeugung von Biomasse, in der Nahrungsmittelindustrie oder bei Fermentation sverfahren nutzlich ist. 

22. Verfahren nach Anspruch 21 , dadurch gekennzeichnet, daB der Mikroorganismus fur die Fermentation von Xylose 
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in Ethanol nutzlich ist 



23. Verfahren zum Erzeugen von Xylosereciuktase Oder Xylose reduktase und Xylitoldehydrogenase durch Kultivieren 
eines Mikroorganismus, der gemaB einem der Anspruche 16 bis 20 erhaltlich ist, unter geeigneten Bedingungen 

s und Gewinnen des Enzymes (der Enzyme) in an sich bekannter Weise. 

24. Verfahren nach Anspruch 23, dadurch gekennzeichnet t daft der Mikroorganismus fur effiziente Fermentation von 
Xylulose ausgewahlt wird. 

to 25. Verfahren nach Anspruch 23 Oder 24, dadurch gekennzeichnet, daB der Mikroorganismus die DNA-Sequenzen 
Oder die Kombination von DNA-Sequenzen durch Transformation unter Verwendung eines Vektors erhalten hat, 
wobei der Vektor bevorzugt ein DNA-Fragment Oder ein Plasmid ist. 

26. Verfahren nach Anspruch 25, dadurch gekennzeichnet t daB der Vektor DNA enthalt, die der DNA des Mikroorga- 
is nismus homolog ist, was zur Integration in das Genom des Mikroorganismus fuhrt. 

27. Ethanol-Herstellungsverf ahren, dadurch gekennzeichnet, daB ein nach einem der Anspruche 1 6 bis 22 erhaltlicher 
Mikroorganismus verwendet wird. 

20 28. Verfahren nach Anspruch 27, dadurch gekennzeichnet, daB das Fermentationsverfahren an die Erzeugung von 
alkoholischen Getranken oder Einzelzellprotein angepaBt ist, die aus Substraten erzeugt werden, die freie Xylose 
enthalten, die bevorzugt durch Xylanase und/oder Xytosidase-Aktivitat freigesetzt wird. 

29. Verfahren zum Erzeugen von Biomasse, dadurch gekennzeichnet, daB ein Wirtsorganismus nach einem der An- 
25 spruche 1 6 bis 22 verwendet wird. 



Revendications 

30 



Revendications pour les Etats contractants suivants : AT, BE, CH, LI, DE, DK, FR, GB, GR, IT, LU, NL, SE 

1. Sequence d'ADN, caracteris^e en ce que ladite sequence d'ADN comprend un gene de structure codant pour une 
reductase du xylose ayant la sequence suivante d'acides amines : 

35 
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50 
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10 

MPSIK.LNSGY 

20 

DMPAVGFG CW 

30 

KVDVDTCSEQ 

40 

IYRAIKTGYR 

50 

LFDGAEDYAN 

60 

EKLVGAGVKK 

70 

AIDEGIVKRE 

80 

DLFLTSKLWN 

90 

NYHHP 'DNVEK 

100 

ALNRTLS DLQ 

110 

VDYVDLFLIH 

120 

FPV TFKFVPL 

130 

EEK.YPPGFYC 

140 

GKGDNFDYED 
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150 

VP ILETWKAL 

160 

EKLVXAGKIR 

170 

SI GVSNFPGA 

180 

LLLDLLRGAT 

190 

IKPSVLQVEH 

'■ 200 
HPYLQQPRLI 

210 

EFAQSRGIAV 

220 

TAYSSFGPQS 

/ 230 
FVELNQGRAL 



240 

NTSPLFENET 

250 

IKAIAAKHGK 

260 

SPAQVLLRWS 

270 

SQRGIAIIPK 

280 

SNTVPRLLEN 
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290 

KDVNSPD.LDE 

300 

QDFADIAKLD 

310 

INLR FNDPWD 



WDKIPI'FV* 



dans laquelle ladite sequence d'ADN est capable d'exprimer ledit polypeptide dans un micro-organisme. 

Sequence d'ADN selon la revendication 1 , caracterisee en ce que ladite sequence d'ADN comprend en outre un 
gene de structure codant pour la dehydrogenase du xylitol ayant la sequence suivante d'acides amines : 
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10 

MTANPSLVLN 

5 

20 

KID DISFETY 

» .30 

DAPEISEPTD 

40 

is VLVQVKKTGI 

50 

CGSDIHFYAH 



20 



25 



30 



35 



40 



45 



SO 



55 



60 

GRIGNFVLTK 

70 

PMVLGHESAG 

/ 80 
TVVQVGKGVT 

90 

SLKVGD'NV'AI 

100 

EPGIPSHFSD 

110 

EY 'KSGHYNLC 

120 

P H .M A F A A T P N 

130 

SKEGEPNPPG 
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140 

TLCKYFKSPE 

DFLVKLPDH V 

160 

SLELGALVEP 

170 

LSVGVHASKL 

180 

GSVAFG DYVA 

190 

VFGAGPVGLL 

200 

AAAVAKTFGA 

* 210 
KGVIVVDIFD 

220 

NKLKMAKD.IG 

220 

A A T K T F i; S K T 



240 

GGSEELIKAF 

250 

GGNVPNVVLE 

260 

CTGAEPCIKL 
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270 

GVDAIAPGGR 

280 

FVQ .VGNAAGP 

290 

VSFPITVFAM 



300 

KELTLFGSFR 



310 

YGFNDYKTAV 



320 

GIFDTNYQNG 

330 

RENAPIDFEQ 

/ 340 
LITHRYKFKD 



350 

AIEAYD LVRA 

• • » 

360 

GKGAVKCLID 



G P E * 



Sequence d'ADN selon les revendications 1 ou 2. caractSrisSe- en ce que ladite sequence d'ADN est d6rrv6e 
d'une levure, de preference d'une levure choisie parmi un groupe constitue* des genres Schwanniomyces, Sac- 
charomyces, Kluyveromyces, Pichia, Hansenula. Candida, Debaryomyces, Metschnikowia, et Pachysolen. 

Sequence d'ADN selon la revendication 3, caracterisee en ce que la levure est Pichia stiprtis. de preference Pichia 
stipitis CBS 5773 (DSM 5855). 

Sequence d'ADN selon la revendication 1, comprenant la sequence suivante de nucleotides: 
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. -350 

GGATCCACAGACACTAATTGGTTCTA 
-310 

CATTATTCGTGTTCAGACACAAACCCCAGC 
-290 

GTTGGCGGTTTCTGTCTGCGTTCCTCCAGC 

-250 

ACCTTCTTGCTCAACCCCAGAAGGTGCACA 
-230 

CTGCAGACACACATACATACGAGAACCTGG 

-190 

AACAAATATCGGTGTCGGTGACCGAAATGT 
-170 

GCAAACCCAGACACGACTAATAAACCTGGC 

-130 

AGCTCCAATACCGCCGACAACAGGTGAGGT 
-110 -V 

GACCGATGGGGTGCCAATTAATGTCTGAAA 

-70 

ATTGGGGTATATAAATATGGCGATTCTCCG 
-30 

GAGAATTTTTCAG77TTCTTTTCATTTCTC 

-10 

CAGTATTCTTTTCTATACAACTATACTACA 

10 30 
ATGCCTTCTATTAAGTTGAACTCTGGTTAC 

50 

GACATGCCAGCCGTCGGTTTCGGCTGTTGG 
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70 90 

AAAGTCGACGTCGACACCTGTTCTGAACAG 

110 

ATCTACCGTGCTATCAAGACCGGTTACAGA 

130 150 
TTGTTCGACGGTGCCGAAGATTACGCCAAC 

17 0 

GAAAAGTTAGTTGGTGCCGGTGTCAAGAAG 

190 210 
G CCATTGACGAAGGTATCGTCAAGCGTGAA 

230 

GACTTGTTCCTTACCTCCAAGTTGTGGAAC 

250 270 
AACTACCACCACCCAGACAACGTCGAAAAG 

290 

GCCTTGAACAGAACCCTTTCTGACTTGCAA 

• * 

310 ' ! 330 

GTTGACTACGTTGACTTGTTCTTGATCCAC 

350 

TTCCCAGTCACCTTCAAGTTCGTTCCATTA 

370 390 
GAAGAAAAGTACCCACCAGGATTCTACTGT 

410 

GGTAAGGGTGACAACTTCGACTACGAAGAT 

430-- 450 
GTTCCAATTTTAGAGACCTGGAAGGCTCTT 

470 

GAAAAGTTGGTCAAGGCCGGTAAGATCAGA 
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490 510 
TCTATCGGTGTTTCTAACTTCCCAGGTGCT 

530 

TTGCTCTTGGACTTGTTGAGAGGTGCTACC 

550 570 
AT C AAG C CATCTGT CTTG CAAGTTG AA CAC 

590 

CACCCATACTTGCAACAACCAAGATTGATC 

610 630 
GAATTCGCTCAATCCCGTGGTATTGCTGTC 

650 

ACCGCTTACTCTTCGTTCGGTCCTCAATCT 

670 690 
TTCGTTGAATTGAACCAAGGTAGAGCTTTG 

710 

AACACTTCTCCATTGTTCGAGAACGAAACT 

730 * 750 
ATCAAGGCTATCGCTGCTAAGCACGGTAAG 

770 

TCTCCAGCTCAAGTCTTGTTGAGATGGTCT 

79C sio 
TCCCAAAGAGGCAriGCCATCATTCCAAAG 

830 

TCCAACACTGTCCCAAGATTGTTGGAAAAC 

8.50 870 
AAGGACGTCAACAGCTTCGACTTGGACGAA 

890 

CAAGATTTCGCTGACATTGCCAAGTTGGAC 
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910 930 

ATCAACTTGAGATTCAACGACCCATGGGAC 

950 

TGGGACAAGATTCCTATCTTCGTCTAAGAA 

970 990 
GGTTGCTTTATAGAGAGGAAATAAAACCTA 

1010 

ATATACATTGATTGTACATTTAAAATTGAA 

1030 1050 
TATTGTAGCTAGCAGATTCGGAAATTTAAA 

1070 

ATGGGAAGGTGATTCTATCCGTACGAATGA 

1090 mo 
TCTCTATGTACATACACGTTGAAGATAGCA 

1130 

GTACAGTAGACATCAAGTCTACAGATCATT 

1150 H7 0 

AAACATATCTTAAATTGTAGAAAACTATAA 

* • » 

1190 

A CTTTT C AATT C AAAC C ATGT CTG C C AAG G 

1210 1230 
AA T C AAA 7 G A G A T T 7 7 7 7 T C G C A G C C AAA C 

1250 

TTG AAT C C AAAAAT AAAAAA C G TC ATTGT C 

1270 1290 
TG AAA C AACTCT AT CTT AT CTTT C AC CT C A 

1310 

TCAATT CATTGCATATCATAAAAG CCTCCG 
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1330 1350 
ATAGCATACAAAACTACTTCTGCATCATAT 

5 

1370 

CTAAATCATAGTGCCATATTCAGTAACAAT 

1390 1410 
ACCGGTAAGAAACTTCTATTTTTTTAGTCT 



IS 



20 



25 



35 



40 



45 



SO 



1430 

GCCTTAACGAGATGCAGATCGATGCAACGT 

1450 1470 
AAGATCAAACCCCTCCAGTTGTACAGTCAG 

1490 

TCATATAGTGAACACCGTACAATATGGTAT 

1510 1530 
CT A CGTT C AAAT AG A CT C CAATACAG CTG G 

1550 

T CT G C C C AAGTT G AGC AACTTT AATTT AG A 

1570 1590 
GACAAAGT CGTCTCTGTTGATGTAGGCAC C 

1610 

ACACATTCTTCTCTTGCCCGTGAACTCTGT 

163C 1650 
TCTGGAGTOGAAACATCTCCAGTTGTCAAA 

1670 

TATCAAACACTGACCAGGCTTCAACTGGTA 
1690 

GAAGATTTCGTTTTCGGGATC 



6. Sequence d'ADN selon la revendication 2, comprenant la sequence suivante de nucleotides : 

ss 
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-310 -290 

TCTAGACCACCCTAAGTCGTCCCTATGTCG 

-270 

TATGTTTGCCTCTACTACAAAGTTACTAGC 

-250- -230 
AAATATCCGCAGCAACAACAGCTGCCCTCT 

-210 

TCCAGCTTCTTAGTGTGTTGGCCGAAAAGG 

-190 -170 
CGCTTTCGGGCTCCAGCTTCTGTCCTCTGC 

-150 

GGCTGCTGCACATAACGCGGGGACAATGAC 

-130 -no 
TTCTCCAGCTTTTATTATAAAAGGAGCCAT 

-90 

CTCCTCCAGGTGAAAAATTACGATCAACTT 

-70 -50 
TTACTCTTTTCCATTGTCTCTTGTGTATAC 

-30 

TCACTTTAGTT'TGTTTCAATCACCCCTAAT 

-10 10 
ACTCTTCACACAATTAAAATGACTGCTAAC 

30 

CCTTCCTTGGTGTTGAACAAGATCGACGAC 

50 70 
ATTTCGTTCGAAACTTACGATGCCCCAGAA 
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90 

ATCTCTGAACCTACCGATGTCCTCGTCCAG 

HO 130 
GTCAAGAAAACCGGTATCTGTGGTTCCGAC 

150 

ATCCACTTCTACGCCCATGGTAGAATCGGT 

170 190 
AACTTCGTTTTGACCAAGCCAATGGTCTTG 

210 

GGTCACGAATCCGCCGGTACTGTTGTCCAG 

230 250 
GTTGGTAAGGGTGTCACCTCTCTTAAGGTT 

270 

GGTGACAACGTCGCTATCGAACCAGGTATT 

290 ? 310 

CCATCCAGATTCTCCGACGAATACAAGAGC 

330 

GGTCACTACAACTTGTGTCCTCACATGGCC 

350 370 
TTCGCCGCTACTCCTAACTCCAAGGAAGGC 

390 

GAACCAAACCCACCAGGTACCTTATGTAAG 

410 - 43 0 
TACIT CAAGTCGCCAGAAGACTTCTTGGTC 

450 

AAGTT G C CAG AC CAC GT CAG CTTGG AACTC 
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470 490 
GGTGCTCTTGTTGAGCCATTGTCTGTTGGT 

510 

GTCCACGCCTCCAAGTTGGGTTCCGTTGCT 

530 550 
TTCGGCGACTACGTTGCCGTCTTTGGTGCT 

570 

GGTCCTGTTGGTCTTTTGGCTGCTGCTGTC 

590 610 
GCCAAGACCTTCGGTGCTAAGGGTGTCATC 

630 

GTCGTTGACATTTTCGACAACAAGTTGAAG 

650 670 
ATGGCCAAC-GACATTGGTGCTGCTACTCAC 

** 690 

ACCTTCAACTCCAAGACCGGTGGTTCTGAA 

710 730 
GAATTGATCAAGGCTTTCGGTGGTAACGTG 

750 

770 790 
GAAC C7TGT AT CAAGTTGGGTGTTG ACGC C 

810 

ATTGCCCCAGGTGGTCGTTTCGTTCAAGTT 

830 850 
GGTAACGCTGCTGGTCCAGTCAGCTTCCCA 
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870 

ATCACCGTTTTCGCCATGAAGGAATTGACT 

890 9io 
TTGTTCGGTTCTTTCAGATACGGATTCAAC 

930 

GACTA CAAGACTGCTGTTGGAAT CTTTGAC 

950 970 
ACTAACTACCAAAACGGTAGAGAAAATGCT 

990 

CCAATTGACTTTGAACAATTGATCACCCAC 

1010 10 3 0 

AGATACAAGTTCAAGGACGCTATTGAAGCC 

1050 

TACGACTTGGTCAGAGCCGGTAAGGGTGCT 

1070 / 1090 

GTCAAGTGTCTCATTGACGGC CCTGAGTAA 

1110 

GTCAACCGCTTGGCTGGCCCAAAGTGAACC 

1130 1130 
AG AAA C G AAAATG A IT.-. T C AAAT A.G CTTT A 

1170 

TA GA C CTTTAT C G AAATTT ATGT AAA CT AA 

1190 12 lo 
TAGAAAAGACAGTGTAGAAGTTATATGGTT 

1230 

GCATCACGTGAGTTTCTTGAATTCTTGAAA 
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1250 1270 
GTGAAGTCTTGGTCGGAACAAACAAACAAA 

1290 

AAAATATTTTCAGCAAGAGTTGATTTCTTT 

1310 1330 
T CT GG AG ATTTTG GTAATTG A CAG AGAA C C 

1350 

C CTTT CTGCTATTGC CAT CTAAACATC CTT 

1370 139 0 
GAATAGAACTTTACTGGATGGCCGCCTAGT 

1410 

GTTGAGTATATATTATCAACCAAAATCCTG 

1430 1450 
TATATAGTCTCTGAAAAATTTGACTATCCT 

1470 

AACTTAACAAAAGAGCACCATAATGCAAGC 

1490 i 510 
TCATAGTTCTTAGAGACACCAACTATACTT 

1530 

AGCCAAACAAAATGTCCTTGGCCTCTAAAG 

1550 1570 
AA.GCATTCAC-CAAGCTTCCCCAGAAGTTGC 

1590 

ACAACTTCTTCATCAAGTTTACCCCCAC-AC 

1610 1620 
CGTTTGCCGAATATTCGGAAAAGCCTTCGA 

CTATAGTGGATCC 

Sequence d'ADN selon I'une quelconque des revendications 1 a 6, caracterisee en ce qu'elle est obtenue par la 
technology de PADN recombinant, a partir d'ADN naturel et/ou d*ADNc et/ou d'ADN synthetise chimiquement. 

Combinaison de sequences d'ADN. caracterisee en ce que ladite combinaison comprend une premiere sequence 
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SJuEh w' Une qU H e ' COn r des revend ^tions 1 ft 7 et une ou plusieurs autres sequences d'ADN capable de 
reguler I express.cn d'un gene de structure encode par ladite sequence d'ADN dans un microorganisms 

9 " dpTm^r 50 S ! quences d ' ADN selon ,a "verification 3. caracterisee en ce que ladite combinaison comprend 

SSSST? h eqU . enCeS C ° nSerVam ,6Ur ***** * 6Xprimer une e **™ fonctionnelle ayant une 
actrvite de reductase du xylose ou de dehydrogenase du xylitol. 

10. Combinaison de sequences d'ADN selon la Revendicaticn 8 ou 9, caracterisee en ce que ledit gene de structure 
cont.en, des sequences d'ADN derivees du gene de structure codant pour la reductase du xylie ou ,a dJshSo! 
genase du xyhto. qui modifie ledit produi. de proteine tout en conservan, ses tonctions d'une fa^tl que^ 
produ-t de prote.ne est exprime comme un produit de gene ayant une activite enzymatique 

11. Combinaison de sequences d'ADN selon I'une quelconque des revendications 8 a 10, caracterisee en ce cue 

a 3 .' srr capabies de r ' 9uier ,,expression dudrt 9&ne da - ^t^:z: 

note, sont derivees dudrt micro-organisme hfite. 

12. Combinaison selon la revendication 11, caracterise en ce que lesdites sequences d'ADN capabies de reauler 
I expression sont des promoters susceptibles d'etre induits. capaoies ae reguler 

13 ' StT^STJ TV* feVendiCa ! ion 12 ' caracterisee en ce que lesdites sequences d'ADN capab.es de reguler 
I expression sont choisies parmi les promoteurs suivants ■ 
ADH1. ADH2, PDC, GAL1/10. 

U ' ca^L^TT™ qUelC °" que des "indication, 114 13, caracterisee en ce que ladite sequence d'ADN 

teTotine SI ^TT- Z * "» Pr0m0XeUf COnduisant * une ^expression de 

la proteine encodee par ledtt gene de structure. 

1S " cSi'iT^*" 8 * e " C9 K qU8 ' edi ' vecteurcom P rend u ™ «*l«ence d'ADN selon .'une quelconque des revendi- 
cations 1 a 7 ou une comb.na,son de sequences d'ADN selon I'une quelconque des revendications 8 a 14. 

16 ' ^Z! 6to 2! a r ^ end ' cation 15 ' caracterise en ce que ledit vecteur est choisi parmi le groups comprenant les 
plasm,des pR1 , p R2 , P D1 , pD2, pRDI . pXRa, pXRb, pXDH, pXR. pXDH-HIS3, pXR-LEuf. 

1 ? ' w'VSS^' Ca ; aCt6r f en " que ,edit ™ro-organisme est capable d'exprimer une reductase du xylose ou 
les sequences r deS * ydr °9 6nase du ^ a P^ «* '*cu des sequences d'ADN covenant 

8 °h qU8lConque des ^endications 1 a 7 ou une combinaison de sequences d'ADN 

xvSJ i h ^indications 8 a 14. codant pour ladite reductase du xylose ou ladite reductase du 
xylose et ladite deshydrogenase du xylitol, par la technotogie de I'ADN recombinant 

18 ' cl^TSj'l™ Se 'T ' a revendication 17 ' cauterise en ce que ledit micro-organisme est choisi parmi un groupe 
STSSiSTT £ 9 nT Schizosaccharomyces. Schwann bmyces. K.uyveromyces. Z 

Zymo^as Debaryomyces, Me.schnikowia. PachysolenouPaecilomycesoudebacteriesdu genre 

19 ' rev'isS 93 "' 5 " 19 S9l0n 13 r6VendiCa,i0n 1 8 ' caracterise en ce que ledit microorganisme est Saccharomyces ce- 

20 ' plmb^ r9aniSme Se '° n 13 reV6ndica,ion 18 ' caracterise en ce que ledit mcro-organisme est Schizosaccharomyces 

21 " ou C |^ a h niSme Se J° n i : unec ' uelconc 1 ue des revendications 17a 20. caracterise en ce que ladite sequenced'ADN 
ou la comb.na,son de sequences d'ADN est integree dans le genome dudit mfcroorganiame. 

22 esX^TH^ *tT I'"" 6 \ qUe,COnque des ^vendications 17 a 21. caracterise en ce que led! microorganisme 
est utile dans la production de biomasse. dans I'industrie alimentaire et dans les precedes de fermentation. 

23 .aJcId^ 2Z e " C9 qU9 "» Sanisme est utile pour ,a fermen- 
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24. Procede de production de la reductase du xylose ou de la reductase du xylose et de la dehydrogenase du xvlitol 
par culture d'un micro-organ isme selon Pune quelconque des revendications 17 a 21 sous des conditions aonrcv 
pnees et recuperation du(des)dits enzyme(s) d'une maniere connue en soi. 

25. Procede selon la revendication 24, caracterise en ce que ledit micro^rganisme est choisi pour la fermentation 
efficace du xylulose. UM 

2$ ' ' a rev r diCati ° n 24 ° U 25 ' caract6ris6 ™ ce que ledit micrc^rganisme a recu lesdites sequences 

cPADN ou ladrte combmaison de sequences d'ADN par transformation utflisant un vecteur, ledit vecteur etant de 
preference un fragment d'ADN ou un plasmide. 

27 ' r^M? Se, ° n ' a revendication 26 ' caracterise en ce que ledit vecteur contient de I'ADN, qui est homologue a 
I ADN dudit micro-organisme, conduisant a Integration dans le genome dudit micronorganisme. 

28. Procede de fabrication Methanol, caracterise en ce qu'on utilise un micrc^rganisme selon I'une quelconque des 
revendications 1 7 a 23. ^ 

29. Procede selon la revendication 28, caracterise en ce que le procede de fermentation est adapte a la production 
de boissons alcoolisees ou tfune proteine de cellule unique produite a partir de substrats contenant du xylose 
libre, de preference libere par I'actrvite de la xylanase et/ou de la xylosidase. 

30. Procede pour la production de biomasse, caracterise en ce que I'on utilise !e microorganisme note selon I'une 
quelconque des revendications 17 a 23. 



20 



25 



Revendications pour I'Etat contractant suivant : ES 

1. Procede de preparation d'une sequence d'ADN, cette sequence d'ADN covenant un gene de structure codant 
pour une reductase du xylose ayant la sequence suivante d'acides amines : 



35 



40 



45 



50 



55 
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10 

M PSIXLNSGY 

20 

DMPAVGFGCW 

30 

KVDVDTCSEQ 

40 

IYRAI-KTGYR 

50 

LFDGAEDYAN 

60 

EKLVGAGVKK 

70 

AIDEGIVKRE 

80 

DLFLTSKLWN 

90 

NYHHPDNVEK 

■i ' 

v ' 100 
ALNRTLSDLQ 

110 

VDYVDLFLIH 



45 « „ 120 

^P.VTFKFVPL 

EEKYPPgf y 13 C 
~ „ 140 

gkgdnfdyed 

55 



75 
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30 



35 



40 
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ISO 

VP'ILETWKAL 

160 

EXLVXAGKI R 

170 

SIGVSNFPGA 

180 

LLLDLLRGAT 

190 

IKPSVLQVEH 

200 

HPY. LQQPRLI 

210 

EFAQSRGI AV 

220 

TAYSSFGPQS 

230 

FVELNQGRA.L 

240 

NTSPLFENET 

250 

IKAXAAKHG K 

260 

SPA-QV. LLRWS 

270 

SQRGIAI I. PK 

280 

SNTVPRLLEN 
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KDVNSFD.L D^E 



300 
L D 



INLRFNDP w 31 D 

^ani Sme . danS , eq ue, tedfte 

ou cfADN S ynth6tis 6 chimiquement teChn0l ° 9, ° de ' «*°"*'nant. a partir cfADN naturel et/ou d'ADNc et/ 



111 



EP 0 450 430 B1 



10 

MTANPSLVLN 

s 

KIDDISFET Y 

» -30 

DAPEISEPTD 

40 

is vlvqvkxtg I 

50 

CGSDIHFYAH 

„ « 60 

GRIGNFVLTK 

2s 70 

PMVLGHESAG 



30 



35 



40 



45 



50 



80 

TVVQVGKGVT 



90 

SLKVGDNV.AI 



100 

2 P C- I P S ?. F S D 



110 

EY-KSGHYNLC 

120 

PHMAFAATPN 

13 0 

SKEGEPNPPG 



55 
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20 



25 



30 



35 



40 



45 



SO 



tlckyfks p 14 e 
slelgalv e 16 p 

L S V G V H A S K 1? L 

GSVAFGDY V^A 

V F G A G P V G L 19 L 

A A A V A K T F G 2 °A 

KGVIVVDI F 21 S 

220 



N K L K M A .K D, 
A A T H T F K S K T 



I G 
220 



240 



GGSEE LIKAF 



GGNVPnvv 



250 
L E 



260 



CTGAEPC IKL 



55 
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270 

GVDAIAPGG R 

280 : 

FVQVGNAAGP 

290 

VSFPITVFAM 

300 

KELTLFGSFR 

310 

YGFNDYKTAV 

320 

GIFDTNYQNG 

330 

RENAPIDFEQ 

* 3 40 

LITHRYKFKD 

350 

AIEAYDLVRA 

360 

C- K G A V K C L I D 



G P E * 

^SSS^T"^ 3 ' CaraC,6n ' S6 ^ 06 qUS ^ ' eVUre 6St Pfchia * P^rence Rchia stip« fe 

Precede se.cn .a revendication 1 , dans leque. t sequence cfADN comprend sequence suh,an,e de nucleotides : 
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-350 

GGATCCACAGACACTAATTGGTTCTA 

CATTATTCGTGTTCAGACACAAACCCCAGC 
-290 

GTTGGCGGTTTCTGTCTGCGTTCCTCCAGC 

ACCTTCTTGCTCAACCCCAGAAGGTGCACA 
-230 

CTGCAGACACACATACATACGAGAACCTGG 

AACAAATATCGGTGTCGGTGACCGAAATGT 
-170 

GCAAACCCAGACACGACTAATAAACCTGGC 

AGCTCCAATACCGCCGACAACAGGTGAGGT 
-110 

GACCGATGGGGTGCCAATTAATGTCTGAAA 

■ » 

. ATTGGGGTATATAAATATGGCGATTCTCCG 
-50 

GAGAATTTTTCAC-rTTTCTTTTCATTTCTC 

cagtattcttttctatac^2tatactaca 

10 

ATGCCTTCTATTAAGtTGAACTCTGGTTAC 

50 

GACATGCCAGCCGTCGGTTTCGGCTGTTGG 
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70 90 
AAAGTCGACGTCGACACCTGTTCTGAACAG 

110 

ATCTACCGTGCTATCAAGACCGGTTACAGA 

130 ISO 
TTGTTCGACGGTGCCGAAGATTACGCCAAC 

170 

GAAAAGTTAGTTGGTGCCGGTGTCAAGAAG 

190 210 
GCCATTGACGAAGGTATCGTCAAGCGTGAA 

230 

GACTTGTTCCTTACCTCCAAGTTGTGGAAC 

250 270 
AACTACCACCACCCAGACAACGTCGAAAAG 

290 

GCCTTGAACAGAACCCTTTCTGACTTGCAA 

310 ' 330 

GTTGACTACGTTGACTTGTTCTTGATCCAC 

♦ * » 

350 

TTCCCAGTCACCTTCAAGTTCGTTCCATTA 

370 39Q 
GAAGAAAAGTACCCACCAC-GATTCTACTGT 

410 

GGTAAGGC-TGACAACTTCGACTACGAAGAT 

430- 450 
GTTCCAATTTTAGAGACCTGGAAGGCTCTT 

470 

GAAAAGTTGGTCAAGGCCGGTAAGATCAGA 
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490 

TCTATCGGTGTTTCTAACTTCCCAGGTGCT 

TTGCTCTTGGACTTGTTGAGAGGTGCTACC 

550 - 
ATCAAGCCATCTGTCTTGCAAGTTGAACAC 

CACCCATACTTGCAACAACCAAGATTGATC 
610 

GAATTCGCTCAATCCCGTGGTATTGCTGTC 

650 

ACCGCTTACTCTTCGTTCGGTCCTCAATCT • 



670 

TTCGTTGAATTGAACCAAGGTAGAG 



690 
CTTTG 



710 

AACACTTCTCCATTGTTCGAGAACGAAACT 

73 0 •> 7 - 
ATCAAGGCTATCGCTGCTAAGCACGGTAAG 

♦ ■ » 

"770 

TCTCCAGCTCAAGTCTTGTTGAGATGGTCT 

790 810 

AG 



•i CCCAAAGAGGCAITGCCATCATTCCAAi 



830 

x CCAACACTGTCCCAAGATTGTTGGAAAAC 
850 8?0 
AAGGACGTCAACAGCTTCGACTTGGACGAA 

890 

CAAGATTTCGCTGACATTGCCAAGTTGGAC 
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910 g30 
ATCAACTTGAGATTCAACGACCCATGGGAC 

950 

TGGGACAAGATTCCTATCTTCGTCTAAGAA 

GGTTGCTTTATAGAGAGGAAATAAAACCTA 

ATATACATTGATTGTACATTTAAAATTGAA 
1030 - n . n 
TATTGTAGCTAGCAGATTCGGAAATTTAAA 

ATGGGAAGGTGATTCTATCCGTACGAATGA 

1090 niift 
TCTCTATGTACATACACGTTGAAGATAGCA 

GTAC^GTAGACATCAAGTCTACAGATCATT 
1150 

AAACATATCTTAAATfGTAGAAAACTATAA 

ACTTTTCAATTCAAACCATGTCTGCCAAGG 
1210 

AATCAAATGAGATTTT^TCGCAGCCAAAC 

ttgaatccaaa«\ataaa\Laacgtcattgtc 

TGAAACA^CTCTATCTTATCTTTCACCTCA 
TCAATTCATTGCATATCATAAAAGCCTCCG 
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ArAGCATACAAAACTACTTCTGCATCATAr 

CTAAATCATAGTGCCATATTCAGTAACAAT 

ACCGGTA^GA^CTTCTATTTTTTTAGTCT 

GCCTTAACGAGATGCAGATCGATGCAACGT 
1450 

AAGArCAAACCCCTCCAGTTGTACAGTCAG 

TCATATAGTGAACACCG^CAATATG^TAT 
1510 

CTACGTTCAAATAGACTCCAATACAGCTGG 

TCTGCCCAAGTTGAGCAACTTTAATTTAGA 
1570 

«CAAAGTCGTCTCTG raGATGTAGG ^0 

ACACATTCTTCTCTTGCCCGTGAACTCTGT 

1650 
TGTCAAA 

TATCAA^CACTGACC.4Jc°TC^^ 

* ,v,v - v - i TCAACTGGTA 

1690 

GAAGATTXCGTTTTCGGGATC 



1630 

TCTGGAGTGGA :: C ^-Trr> ^ 1650 



* r8VendiCa,i0n 2 - *■«- ,a sequence CADN com 

d ADN comprend la sequence suivante de nucleotides : 
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TCTAGACCACCCTAAGTCGTCCCTATGTCG 

TATGTTTGCCTCTACTACAAAGTTACTAGC 
-250 

"inn* % « 

T <3CCCTCT 

-210 



AAATATCCGCAGCAACAACAGC- " 23 ° 



TCCAGCTTCTTAGTGTGTTGGCCGAAAAGG 

cgctttcIggctccagcttctgtcctSc 

GGCTGCTGCACATAACGCGGGGACAATGAC 
TTCTCCAGCTTTTATTATAAAAGGAGCCAT 

ctccxccaggtgaaaaa"^acgaxcaacxt 

ITACTCTTTTCCATTGTCTCTTGTGTATAC 



TCACTTTAGTTTGTTTCAATCACCCCTAAT 
ACTCTTCACAC^ATTAAAATGACTGCTAAC 
CCTTCCTTGGTGTTGAACAAGATCGACGAC 
ATTTCGTTCGAAACTTACGATGCCCCAGAA 
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70 90 
AAAGTCGACGTCGACACCTGTTCTGAACAG 

110 

ATCTACCGTGCTATCAAGACCGGTTACAGA 

130 ISO 
TTGTTCGACGGTGCCGAAGATTACGCCAAC 

170 

GAAAAGTTAGTTGGTGCCGGTGTCAAGAAG 

190 210 
GCCATTGACGAAGGTATCGTCAAGCGTGAA 

230 

GACTTGTTCCTTACCTCCAAGTTGTGGAAC 

250 270 
AACTACCACCACCCAGACAACGTCGAAAAG 

290 

GCCTTGAACAGAACCCTTTCTGACTTGCAA 

310 330 
GTTGACTACGTTGACTTGTTCTTGATCCAC 

350 

TTCCCAGTCACCTTCAAGTTCGTTCCATTA 

370 390 
GAAGAAAAGTACCCACCAGGATTCTACTGT 

410 

GGTAAGGGTGACAACTTCGACTACGAAGAT 

430 450 
GTTCCAATTTTAGAGACCTGGAAGGCTCTT 

470 

GAAAAGTTGGTCAAGGCCGGTAAGATCAGA 
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90 

ATCTCTGAACCTACCGATGTCCTCGTCCAG 

110 130 
GTCAAGAAAACCGGTATCTGTGGTTCCGAC 

■ ISO 

ATCCACTTCTACGCCCATGGTAGAATCGGT 

170 190 
AACTTCGTTTTGACCAAGCCAATGGTCTTG 

210 

GGTCACGAATCCGCCGGTACTGTTGTCCAG 

230 250 
GTTGGTAAGGGTGTCACCTCTCTTAAGGTT 

270 

GGTG ACAACGTCGCTATCG AAC CAGGT ATT 

290 * 310 

CCATCCAGATTCTCCGACGAATACAAGAGC 

330- 

GGTCACTACAACTTGTGTCCTCACATGGCC 

350 370 
TTCGCCGCTACTCCTAACTCCAAGGAAGGC 

390 

GAACCAAACCCACCAGGTACCTTATGTAAG 

410 .. 430 
TACTTCAAGTCGCCAGAAGACTTCTTGGTC 

450 

AAGTTGCCAGACCACGTCAGCTTGGAACTC 
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470 490 

GGTG CT CTTGTTGAGCCATTGTCTGTTGGT 

510 

GTCCACGCCTCCAAGTTGGGTTCCGTTGCT 

530 550 
TTCGGCGACTACGTTGCCGTCTTTGGTGCT 

570 

GGTCCTGTTGGTCTTTTGGCTGCTGCTGTC 

590 610 
GCCAAGACCTTCGGTGCTAAGGGTGTCATC 

630 ' 

GTCGTTGACATTTTCGACAACAAGTTGAAG 

650 . 670 
ATGGCCAAGGACATTGGTGCTGCTACTCAC 

? 690 

ACCTTCAACTCCAAGACCGGTGGTTCTGAA 

710 *' 730 

G AATTG ATCAA G G CTTT CGGTG GTAACGT G 

750 

770 790 
GAACCTTGTA.TCAAGTTGGGTGTTGACGCC 

810 

ATTGCCCCAGGTGGTCGTTTCGTTCAAGTT 

830 850 
GGTAACGCTGCTGGTCCAGTCAGCTTCCCA 
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870 

ATCACCGTTTTCGCCATGAAGGAATTGACT 

890 910 
TTGTTCGGTTCTTTCAGATACGGATTCAAC 

930 

GACTACAAGACTGCTGTTGGAATCTTTGAC 

950 970 
ACTAACTACCAAAACGGTAGAGAAAATGCT 

990 

CCAATTGACTTTGAACAATTGATCACCCAC 

1010 1030 
AGATA CAAGTTCAAG GA CGCTATTGAAGC C 

1050 

TACGACTTGGTCAGAGCCGGTAAGGGTGCT 

1070 •? 1090 

GTCAAGTGTCTCATTGACGGCCCTGAGTAA 

'lllO 

GTCAACCGCTTGGCTGGCCCAAAGTGAACC 
1130 1150 

1170 

1190 1210 
TAGAAAAGACAGTGTAGAAGTTATATGGTT 

1230 

G CAT CA CGTG AGTTT CTT G AATT CTTG AAA 
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1250 1270 
GTGAAGTCTTGGTCGGAACAAACAAACAAA 

1290 

AAAATATTTTCAGCAAGAGTTGATTTCTTT 

1310 1330 
TCTGGAGATTTTGGTAATTGACAGAGAACC 

1350 

CCTTTCTGCTATTGCCATCTAAACATCCTT 

» 

1370 1390 
GAATAGAACTTTACTGGATGGCCGCCTAGT 

1410 

GTTGAGTATATATTATCAACCAAAATCCTG 

1430 1450 
TATATAGTCTCTGAAAAATTTGACTATCCT 

1470 

AACTTAACAAAAGAGCACCATAATGCAAGC 

1490 1510 
TCATAGTTCTTAGAGACA<PCAAQTATACTT 

,1530 

AGCCAAAC\AAATGTCCTTGGCCTCTAAAG 
1550 1570 

1590 

ACAACTTCTTCATCAAGTTTACCCCCAGAC 

1610 1620 
CGTTTGCCGAATATTCGGAAAAGCCTTCGA 

CTATAGTGGATCC 



Precede de preparation d'une combinaison de sequences d'ADN. ledit precede comprenant la combinaison d'une 
premiere sequence cfADN susceptible d'etre obtenue selon i'une quelconque des revendications 1 a 6 et une ou 
plusieurs autres sequences d'ADN capables de reguler rexpression d'un gene d structure encode par ladit 
sequence d'ADN dans un micro-organisme note d'une maniere connue en soi. 
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8. Procede selon la revendication 7, dans tequel ladite combinaison de sequences comprend des modifications des 
sequences d'ADN conservant leur capacity a exprimer una enzyme fonctionnelle ayant une activite de reductase 
du xylose ou de dehydrogenase du xylitol. 

5 9. Procede selon Tune quelconque des revendications 7 ou 9, dans lequel tedit gene de structure contient des se- 
quences d'ADN derivees du gene de structure codant pour la reductase du xylose ou la dehydrogenase du xylitol 
qui modifie ledit produit de proteine tout en conservant ses fonctions d'une facon telle que ledit produit de proteine 
est exprime comme un produit de gene ayant une activite enzymatique. 

10 10. Procede selon Tune quelconque des revendications 7 a 9, dans lequel lesdites sequences d'ADN capables de 
reguler ('expression dudit gene de structure dans un micro-organisme note, sont ddrivees dudit micro-organisme 
note. 

11. Procede selon la revendication 10, dans lequel lesdites sequences d'ADN capables de reguler I'expression sont 
des promoteurs susceptibles d'etre induits. 

12. Procede selon la revendication 1 1 , caracterise en ce que lesdites sequences d'ADN capables de reguler I'expres- 
sion sont choisies parmi les promoteurs suivants : . 

ADH1 , ADH2, PDC, GAL1/10. 

20 

13. Procdde selon I' une quelconque des revendications 10 a 12, dans lequel ladite sequence d'ADN capable de reguler 
Pexpression dudit gene de structure, est un promoteurfort, conduisanta une surexpresston de la proteine encodee 
par ledit gene de structure. 

2S 14. Procede de preparation d'un vecteur, ledit procede comprenant ('insertion d'une sequence d'ADN susceptible d'etre 
obtenue selon I'une quelconque des revendications 1 a 6 ou d'une combinaison de sequences d'ADN susceptibles 
d'etre obtenues selon I'une quelconque des revendications 7 a 13, dans un plasmide hdte. 

15. Procede selon la revendication 14, caracterise en ce qu'il produit un vecteur choisi parmi le groupe comprenant 
30 les plasmides pR1 , pR2, pD1 , pD2, pRD1 , pXRa, pXRb, pXDH, pXR, pXDH-HIS3, pXR-LEU2. 

16. Procede de preparation d'un micro-organisme capable d'exprimer une reductase du xylose ou une reductase du 
xylose et une deshydrogenase du xylitol, dans lequel des sequences d'ADN comprenant les sequences d'ADN 
susceptibles d'etre obtenues selon I'une quelconque des revendications 1 a 6 ou une combinaison de sequences 

35 d'ADN susceptibles d'etre obtenues selon I'une quelconque des revendications 7 a 1 3, codant pour ladite reductase 

du xylose ou ladite reductase du xylose et ladite deshydrogenase du xylitol, sont introduces dans un micro-orga- 
nisme hote. 

17. Procede selon la revendication 16, caracterise en ce que ledit micro-organisme hdte est choisi parmi un groupe 
<o constitue de levures des genres Saccharomyces, Schizosaccharomyces, Schwann iomyces, Kluyveromyces, Pi- 

chia, Hansenula, Candida, Debaryomyces, Metschnikowia, Pachysolenou Paecilomyces ou de bacteriesdu genre 
Zymomonas. 



45 



so 



18. Procede selon la revendication 17, caracterise en ce que ledit micro-organisme est Saccharomyces cerevisiae. 

19. Procede selon la revendication 1 7, caracterise en ce que ledit micro-organisme est Schizosaccharomyces pombe. 

20. Procede selon l*une quelconque des revendications 16 a 19, caracterise en ce que ladite sequence d'ADN ou la 
combinaison de sequences d'ADN est integr6e dans le g6nome dudit micro-organisme. 

21. Procede selon I'une quelconque des revendications 17 a 21, caracterise en ce que ledit micro-organisme est utile 
dans la production de biomasse, dans I'industrie alimentaire et dans les precedes de fermentation. 

22. Procede selon la revendication 21, caracterise en ce que ledit micro-organisme est utile pour la fermentation du 
xylose dans I'ethanol. 

23. Procede de production de la reductase du xylose ou de la reductase du xylose et de la deshydrogenase du xylitol 
par culture d'un micro-organisme susceptible d'etre obtenu selon I'une quelconque des revendications 16 a 20 
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sous des conditions appropri6es et recuperation du(des)d*rts enzyme(s) d'une maniere connue en soi. 

24. Prcc6d6 selon la revendication 23, caracterise en ce que ledit micro-organ isme est choisi pour la fermentation 
efficace du xylulose. 

5 

25. Proc6d6 selon la revendication 23 ou 24, caracterise en ce que ledit micro-organ isme a recu lesdites sequences 
d'ADN ou ladite combinaison de sequences d'ADN par transformation utilisant un vecteur, ledit vecteur etant de 
preference un fragment d'ADN ou un plasmide. 

10 26. Precede selon la revendication 25, caracterise en ce que ledit vecteur contient de PADN, qui est homologue a 
I'ADIM dudit micro-organism e, conduisant a ('integration dans le genome dudit micrc-organisme. 

27. Precede de fabrication d'ethanol, caracterise en ce qu'on utilise un micro-organisme selon Tune quelconque des 
revendications 1 6 a 22. 

15 

28. Precede selon la revendication 27, caracterise en ce que le precede de fermentation est adapte a la production 
de boissons alcoolisees ou d'une proteine de cellule unique produite a partir de substrats contenant du xylose 
libra, de preference libere par I'activite de la xytanase et/ou de ta xylosidase. 

20 29. Precede pour la production de biomasse, caracterise en ce que Ton utilise le micro-organisme hdte selon Tune 
quelconque des revendications 16 a 22. 
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Fig.2A (1) 

-350 

GGATCCACAGACACTAATTGGTTCTA 

-310 

CATTATTCGTGTTCAGACACAAACCCCAGC 
-290 

GTTGGCGGTTTCTGTCTGCGTTCCTCCAGC 

-250 

ACCTTCTTGCTCAACCCCAGAAGGTGCACA 
-230 

CTGCAGACACACATACATACGAGAACCTGG 

-190 

AACAAATATCGGTGTCGGTGACCGAAATGT 
-170 

GCAAACCCAGACACGACTAATAAACCTGGC 

-130 

AGCTCCAATACCGCCGACAACAGGTGAGGT 
-110 

GACCGATGGGGTGCCAATTAATGTCTGAAA 

-70 

ATTGGGGTATATAAATATGGCGATTCTCCG 
-50 

GAGAATTTTTCAGTTTTCTTTTCATTTCTC 

-10 

CAGTATTCTTTTCTATACAACTATACTACA 

10 30 
ATGCCTTCTATTAAGTTGAACTCTGGTTAC 
MPS IKLNSGY 
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Fig.2A (2) 



50 

GACATGCCAGCCGTCGGTTTCGGCTGTTGG 
DMPAVGFGCW 

70 90 
AAAGTCGACGTCGACACCTGTTCTGAACAG 
KVDVDTCSEQ 

110 

ATCTACCGTGCTATCAAGACCGGTTACAGA 
IYRAI KTGYR 

130 150 
TTGTTCGACGGTGCCGAAGATTACGCCAAC 
LFDGAEDYAN 

170 

GAAAAGTTAGTTGGTGCCGGTGTCAAGAAG 
EKLVGAGVKK 

190 210 
GCCATTGACGAAGGTATCGTCAAGCGTGAA 
AIDEGIVKRE 

230 

GACTTGTTCCTTACCTCCAAGTTGTGGAAC 
DLFLTSKLWN 

250 270 
AACTACCACCACCCAGACAACGTCGAAAAG 
NYHHPDNVEK 

290 

GCCTTGAACAGAACCCTTTCTGACTTGCAA 
ALNRTLSDLQ 

310 330 
GTTGACTACGTTGACTTGTTCTTGATCCAC 
VDYVDLFLIH 
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Fig.2A (3) 



350 

TTCCCAGTCACCTTCAAGTTCGTTCCATTA 
FPVTFKFV PL 

370 390 
GAAGAAAAGTACCCACCAGGATTCTACTGT 
EEKYPPGFYC 

410 

GGTAAGGGTGACAACTTCGACTACGAAGAT 
GKGDNFDY ED 

430 450 
GTTCCAATTTTAGAGACCTGGAAGGCTCTT 
VPILETWKAL 

470 

GAAAAGTTGGTCAAGGCCGGTAAGATCAGA 
EKLV KAGKIR 

490 510 
TCTATCGGTGTTTCTAACTTCCCAGGTGCT 
SIGVSNFPGA 

530 

TTGCTCTTGGACTTGTTGAGAGGTGCTACC 
LLLDLLRGAT 

550 570 
ATCAAGCCATCTGTCTTGCAAGTTGAACAC 
IKPSVLQVEH 

590 

CACCCATACTTGCAACAACCAAGATTGATC 
HPYLQQPRLI 

610 630 
GAATTCGCTCAATCCCGTGGTATTGCTGTC 
EF AQSRGIAV 
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Fig.2A (4) 



650 

ACCGCTTACTCTTCGTTCGGTCCTCAATCT 
TAYSSFGPQS 

670 690 
TTCGTTGAATTGAACCAAGGTAGAGCTTTG 
FVELNQ GRAL 

710 

AACACTTCTCCATTGTTCGAGAACGAAACT 
NTSPLFENET 

730 750 
ATCAAGGCTATCGCTGCTAAGCACGGTAAG 
IKAIAAKHGK 

770 

TCTCCAGCTCAAGTCTTGTTGAGATGGTCT 
S PAQVLLRWS 

790 810 
TCCCAAAGAGGCATTGCCATCATTCCAAAG 
SQRGIAIIPK 

830 

TCCAACACTGTCCCAAGATTGTTGGAAAAC 
S NTVPRLLEN 

850 870 
AAGGACGTCAACAGCTTCGACTTGGACGAA 
KDVNSFD LDE 

890 

CAAGATTTCGCTGACATTGCCAAGTTGGAC 
QDFADIAKLD 

910 930 
ATCAACTTGAGATTCAACGACCCATGGGAC 
INLRFNDPWD 
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Fig.2A (5) 



950 

TGGGACAAGATTCCTATCTTCGTCTAAGAA 
WDKIPIFV* 

970 990 
GGTTGCTTTATAGAGAGGAAATAAAACCTA 

1010 

ATATACATTGATTGTACATTTAAAATTGAA 

1030 1050 
TATTGTAGCTAG CAG ATTCGG AAATTTAAA 

1070 

ATGGGAAGGTGATTCTATCCGTACGAATGA 

1090 1110 
TCTCTATGTACATACACGTTGAAGATAGCA 

1130 

GTACAGTAGACATCAAGTCTACAGATCATT 

1150 1170 
AAACATATCTTAAATTGTAGAAAACTATAA 

1190 

ACTTTTCAATTCAAACCATGTCTGCCAAGG 

1210 1230 
AATCAAATGAGATTTTTTTCGCAGCCAAAC 

1250 

TTGAATCCAAAAATAAAAAACGTCATTGTC 

1270 1290 
TGAAACAACTCTATCTTATCTTTCACCTCA 

1310 

TCAATTCATTGCATATCATAAAAGCCTCCG 
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Fig.2A (6) 



1330 1350 
ATAGCATACAAAACTACTTCTGCATCATAT 

1370 

CTAAATCATAGTGCCATATTCAGTAACAAT 

1390 1410 
ACCGGTAAGAAACTTCTATTTTTTTAGTCT 

1430 

GCCTTAACGAGATGCAGATCGATGCAACGT 

1450 1470 
AAGATCAAACCCCTCCAGTTGTACAGTCAG 

1490 

TCATATAGTGAACACCGTACAATATGGTAT 

1510 1530 
CTACGTTCAAATAGACTCCAATACAGCTGG 

1550 

TCTGCCCAAGTTGAGCAACTTTAATTTAGA 

1570 1590 
GACAAAGTCGTCTCTGTTGATGTAGGCACC 

1610 

ACACATTCTTCTCTTGCCCGTGAACTCTGT 

1630 1650 
TCTGGAGTGGAAACATCTCCAGTTGTCAAA 

1670 

TATCAAACACTGACCAGGCTTCAACTGGTA 
1690 

GAAGATTTCGTTTTCGGGATCC 
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2B < 1 > ,,n 

-310 -290 

TCTAGACCACCCTAAGTCGTCCCTATGTCG 

-270 

TATGTTTGCCTCTACTACAAAGTTACTAGC 

-250 -230 
AAATATCCGCAGCAACAACAGCTGCCCTCT 

-210 

TCCAGCTTCTTAGTGTGTTGGCCGAAAAGG 

-190 -170 
CGCTTTCGGGCTCCAGCTTCTGTCCTCTGC 

-150 

GGCTGCTGCACATAACGCGGGGACAATGAC 

-130 -no 
TTCTCCAGCTTTTATTATAAAAGGAGCCAT 

-90 

CTCCTCCAGGTGAAAAATTACGATCAACTT 

-70 -50 
TTACTCTTTTCCATTGTCTCTTGTGTATAC 

-30 

TCACTTTAGTTTGTTTCAATCACCCCTAAT 

-10 10 
ACTCTTCACACAATTAAAATGACTGCTAAC 

M T A N 

30 

CCTTCCTTGGTGTTGAACAAGATCGACGAC 
PSLVLNKIDD' 
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Fig.2B (2) 

50 70 
ATTTCGTTCGAAACTTACGATGCCCCAGAA 
ISFETYDAPE 

90 

ATCTCTGAACCTACCGATGTCCTCGTCCAG 
ISEPTDVLVQ 

110 130 
GTCAAGAAAACCGGTATCTGTGGTTCCGAC 
VKKTGICGSD 

150 

ATCCACTTCTACGCCCATGGTAGAATCGGT 
IHFYAHGRIG 

170 190 
AACTTCGTTTTGACCAAGCCAATGGTCTTG 
NFVLTKPMVL 

210 

GGTCACGAATCCGCCGGTACTGTTGTCCAG 
GHESAGTVVQ 

230 250 
GTTGGTAAGGGTGTCACCTCTCTTAAGGTT 
VGKGVTSLKV 

270 

GGTGACAACGTCGCTATCGAACCAGGTATT 
GDNVAIEPGI 

290 310 
CCATCCAGATTCTCCGACGAATACAAGAGC 
PSRFSDEYKS 

330 

GGTCACTACAACTTGTGTCCTCACATGGCC 
GHYNLCPHMA 
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Fig.2B (3) 

350 370 
TTCGCCGCTACTCCTAACTCCAAGGAAGGC 
FAATPNSKE G 

390 

GAACCAAACCCACCAGGTACCTTATGTAAG 
EPNPPGTLCK 

410 430 
TACTTCAAGTCGCCAGAAGACTTCTTGGTC 
YFKSPEDFLV 

450 

AAGTTGCCAGACCACGTCAGCTTGGAACTC 
KLPDH VSLEL 

470 490 
GGTGCTCTTGTTGAGCCATTGTCTGTTGGT 
GALVE PLSVG 

510 

GTCCACGCCTCCAAGTTGGGTTCCGTTGCT 
VHASKLGSVA 

530 550 
TTCGGCGACTACGTTGCCGTCTTTGGTGCT 
FGDYVAVFGA 

570 

GGT CCTGTTG GTCTTTTGG CTG CTGCTGT C 
GPVGLLAAAV 

590 610 
GCCAAGACCTTCGGTGCTAAGGGTGTCATC 
AKTFGAKGVI 

630 

GTCGTTGACATTTTCGACAACAAGTTGAAG 
VVDIFDNKLK 
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Fig.2B (4) 

650 570 

ATGGCCAAGGACATTGGTGCTGCTACTCAC 
MAKDIGAATH 

690 

ACCTTCAACTCCAAGACCGGTGGTTCTGAA 
TFNSKTGGSE 

710 730 

GAATTGATCAAGGCTTTCGGTGGTAACGTG 
ELIKAFGGNV 

750 

CCAAACGTCGTTTTGGAATGTACTGGTGCT 
PNVVLECTG A 

770 790 
GAACCTTGTATCAAGTTGGGTGTTGACGCC 
EPCIKLGVDA 

810 

ATTG CCCCAGGTGGTCGTTTCGTTCAAGTT 
IAPGGRFVQV 

830 850 
GGTAACGCTGCTGGTCCAGTCAGCTTCCCA 
GNAAGPVSFP 

870 

ATCACCGTTTTCGCCATGAAGGAATTGACT 
ITVFAMKELT 

890 910 
TTGTTCGGTTCTTTCAGATACGGATTCAAC 
LFGSFRYGFN 

930 

GACTACAAGACTGCTGTTGGAATCTTTGAC 
DYKTAVGI FD 
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Fig.2B (5) 



950 970 
ACTAACTACCAAAACGGTAGAGAAAATGCT 
TNYQNGRENA 

990 

CCAATTGACTTTGAACAATTGATCACCCAC 
PI DFEQLITH 

1010 1030 
AGATACAAGTTCAAGGACGCTATTGAAGCC 
RY KFKDAIEA 

1050 

TACGACTTGGTCAGAGCCGGTAAGGGTGCT 
YDLVRAGKGA 

1070 1090 
GTCAAGTGTCTCATTGACGGCCCTGAGTAA 
VKCLIDGPE* 

1110 

GTCAACCGCTTGGCTGGCCCAAAGTGAACC 

1130 1150 
AGAAACGAAAATGATTATCAAATAGCTTTA 

1170 

TAGACCTTTATCGAAATTTATGTAAACTAA 

1190 1210 
TAGAAAAGACAGTGTAGAAGTTATATGGTT 

1230 

GCATCACGTGAGTTTCTTGAATTCTTGAAA 

1250 1270 
GTGAAGTCTTGGTCGGAACAAACAAACAAA 

1290 

AAAATATTTTCAGCAAGAGTTGATTTCTTT 
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Fig.2B (6) 

1310 1330 
TCTGGAGATTTTGGTAATTGACAGAGAACC 

1350 

CCTTTCTGCTATTGCCATCTAAACATCCTT 

1370 1390 
GAATAGAACTTTACTGGATGGCCGCCTAGT 

1410 

GTTGAGTATATATTATCAACCAAAATCCTG 

1430 1450 
TATATAGTCTCTGAAAAATTTGACTATCCT 

1470 

AACTTAACAAAAGAGCACCATAATGCAAGC 

1490 1510 
TCATAGTTCTTAGAGACACCAACTATACTT 

1530 

AGCCAAACAAAATGTCCTTGGCCTCTAAAG 

1550 1570 
AAGCATTCAGCAAG CTTCCCCAGAAGTTGC 

1590 

ACAACTTCTTCATCAAGTTTACCCCCAGAC 

1610 1630 
CGTTTGCCGAATATTCGGAAAAGCCTTCGA 



CTATAGTGGATCC 
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